About this course
Course Overview
Data Analysis is a process of applying statistical and mathematical techniques systematically to understand, explore, and analyze data to find patterns, and draw inferences that help businesses make data-driven decisions. This typically involves multiple activities such as data collection, exploration, cleaning, pre-processing, and organizing data. Many times, data analysis is an iterative ongoing process where the data is continuously collected and analyzed simultaneously. There are two primary methods for data analysis.
Qualitative techniques and quantitative techniques. Quantitative data analysis techniques involve working with quantitative/numerical data including statistics, percentages, and calculations. These techniques also include working with algorithms, mathematical analysis tools, and software to manipulate data and uncover hidden business value. For example, quantitative data analysis used to assess market data helps a company decide a price for its new product.
Qualitative data analysis involves, working with non-numerical data i.e categorical variables. Qualitative data analysis is also used in many business processes, such as identifying themes and patterns, answering research questions, etc to improve a product.
This course provides an overview of data concepts and what data analysis is and then deep dives into the fundamentals of Data Analysis such as statistics and probability. This course also focuses on widely used data analysis methods such as regression along with detailed steps to perform the same.
*Must have Microsoft Excel in order to complete class activities.
Target Audience
This data analysis course is perfect for anyone who needs an introduction to Data Analysis such as Business Analysts, Management, Engineers, DevelopersCourse Objectives
- Data Analysis process, benefits and use cases
- Basic of Probability and Statistics
- Measure of data spread and distributions
- Inferential Statistics and Hypothesis Testing
- Applications of Statistics and Probability theory
- Forecast trends using linear regression analysis
Course Content
- 1. All about Data
Data in the real world - A brief on various formats and sources of data
- 7 V's of Data
- Structured vs Unstructured vs Semi-Structured data
- Data processing types
Introduction to Data Analysis
- Need for Data Analysis
- Applications and Use Cases of Data Analysis
- Data Analysis Methodology
- Types of variables
- Numerical vs Categorial Variables
Descriptive Statistics
- Measures of Central Tendency
- Measures of Dispersion
- Data Skewness and Kurtosis
- Understanding Outliers
- Understanding missing values
- Role of Descriptive Statistics in Data Analysis
Inferential Statistics
- Population and Sample
- Statistics vs Parameters
Introduction to Probability
- Basics of Probability
- Axioms of Probability
- Conditional Probability and Bayes theorem
- 2. Applications of Conditional Probability
Understanding Probability Distributions - Discrete Probability Distributions
- Continuous Probability Distributions
- Performing Distributions in Excel
- Why understanding Data Distributions is important for Data Analysis
Data Analysis Process
- Understanding Covariance and Correlation
- Understanding univariate vs Bivariate vs Multi variate data analysis
- Understanding Regression
- Simple Linear Regression
- Multiple Linear Regression
- Exercise
Introduction to Predictive Analytics
- Exercise
Course Prerequisites
- Some familiarity with data terminologies
- MS Excel for Exercises