Friday, March 22, 2019

Intro to SQL / No SQL / Spark for Data Science NYC

What you'll learn in this sql class:

2 hours hands on interactive in person SQL course in heart of NYC.

Data pulling and organization is a big effort in Data Science. Data exists in variety of Data source and the hardest part is to create a final Data Frame to run Data Science Algorithms. 

Most of job today requires Data Science Teams to get data from various sources. Creating data lakes is picking space but at this moment the the velocity of new data is so fast that companies expect Data Scientists to access data from variety of sources. Getting muddled up in Machine Learning Math sometimes leads new learners no chance to delve into data engineering aspects. 

Topics:
  • Jupyter Notebook
  • Connect MySQL to Python
  • Connect SQLite to Python
  • Push Dataframe to SQL - SQL to Dataframe
  • Push MongoDB JSON to Pandas - Pandas to MongoDB
  • PySpark for Unstructured Data & running Map reduce
  • Learning MLib for running Machine Learning
  • Scraping data from Beautiful soup
This is an optimal course for people who wants to get deeper in Data engineers aspect of Data Science. 

You will be able to answer the below questions after you take this session:
  • How would I create a data science policy to utilize SQL and NoSQL Databases?
  • Should I use Hadoop in my current data?
  • How to connect Python to MySQL to improve intra team collaboration and putting my machine learning algo into an app?
  • How to parse JSON data or data from API?
  • How to use Pyspark to take care of unstructured data in Python?
Our instructor has extensive experience in delivering data science and data engineering courses in New York City.

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.