Site icon QUE.com

Introduction to Data Analytics

Data is the new Gold in the eyes of any entrepreneurs and business owners out there. This is where you get information to support your next strategic plan to improve your business productivity.

As I continue to learn this myself, I would like to share few things to remember in Data Analytics a 7 steps workflow.

  1. Business Problem. You need to it first so you can plan your attack of action.
  2. Data Acquisition. You need to have some data to create your baseline of execution. This is where you set your start your baseline and measure results (gaining or losing).
  3. Data Wrangling. You need your Data Scientist to help sanitize your data to bring the value out of it.
  4. Exploratory Data Analysis (EDA). I first learn this keyword while learning Data Science at Simplilearn. I first see it as part of work flow of Data Wrangling. EDA studies the data to recommend models that best fit the data.
  5. Data Exploration.
  6. Conclusion or prediction. This is where we produce the reports.
  7. Communication. Sharing the report to the management.

Useful tools for EDA Graphical Technique. Histogram and Scatter Plots are two popular graphical techniques to depict data.

Histogram graphically summarizes the distribution of a univariate datasets, it shows:

Scatter plot represent relationship between two variables, and provides answers these questions visually.

Data Types of Plotting.

Data Analytics – Skills and Tools

Questions or Business Problem

  • Ability to ask appropriate questions and know the business
  • Domain Knowledge
  • Passion for Data
  • Analytical approach

Data Acquisition

  • Beautiful Soup for web scraping
  • CSV or other file knowledge
  • NumPy
  • Pandas
  • Database

Data Wrangling

  • CSV or other file knowledge
  • NumPy
  • Pandas
  • Database
  • SciPy

Data Exploration

  • NumPy
  • SciPy
  • Pandas
  • Matplotlib

Conclusion or Predictions

  • Scikit-Learn, the machine learning library
  • CSV or other file knowledge
  • NumPy
  • Pandas
  • Database
  • SciPy

Communication or Data Visualization

  • Pandas
  • Database
  • Matplotlib
  • PPT
  • CSV or other file knowledge

Key Takeaways to remember.

Interested to learn more, continue reading articles in Artificial Intelligence, Machine Learning and Data Science..

Featured image by xresch

Exit mobile version