Make sure to check out my twitter feed for monthly promo codes and other updates (@easystats3).
The Essential Guide to Data Analytics with Stata
Learning and applying new statistical techniques can be daunting experience.
This is especially true once one engages with “real life” data sets that do not allow for easy “click-and-go” analysis, but require a deeper level of understanding of programme coding, data manipulation, output interpretation, output formatting and selecting the right kind of analytical methodology.
Make sure to check out my twitter feed for monthly promo codes and other updates (@easystats3).
The Essential Guide to Data Analytics with Stata
Learning and applying new statistical techniques can be daunting experience.
This is especially true once one engages with “real life” data sets that do not allow for easy “click-and-go” analysis, but require a deeper level of understanding of programme coding, data manipulation, output interpretation, output formatting and selecting the right kind of analytical methodology.
In this course you will receive a comprehensive introduction to Stata and its various uses in modern data analysis. You will learn to understand the many options that Stata gives you in manipulating, exploring, visualizing and modelling complex types of data. By the end of the course you will feel confident in your ability to engage with Stata and handle complex data analytics. The focus of each session will consistently be on creating a “good practice” and emphasising the practical application – and interpretation – of commonly used statistical techniques without resorting to deep statistical theory or equations.
This course will focus on providing an overview of data analytics using Stata.
No prior engagement with is Stata needed. Some prior statistics knowledge will help but is not necessary.
The course is aimed at anyone interested in data analytics using Stata.
Like for other professional statistical packages the course focuses on the proper application - and interpretation - of code.
Some basic quantitative/statistical knowledge will be required; this is not an introduction to statistics course but rather the application and interpretation of such using Stata.
Topics covered include:
Getting started with Stata
Viewing and exploring data
Manipulating data
Visualising data
Correlation and ANOVA
Regression including diagnostics (Ordinary Least Squares)
Regression model building
Hypothesis testing
Binary outcome models (Logit and Probit)
Fractional response models (Fractional Logit and Beta Regression)
Categorical choice models (Ordered Logit and Multinomial Logit)
Simulation techniques (Random Numbers and Simulation)
Count data models (Poisson and Negative Binomial Regression)
Survival data analysis (Parametric, Cox-Proportional Hazard and Parametric Survival Regression)
Panel data analysis (Long Form Data, Lags and Leads, Random and Fixed Effects, Hausman Test and Non-Linear Panel Regression)
Difference-in-differences analysis (Difference-in-Difference and Parallel Trends)
Instrumental variable regression (Endogenous Variables, Sample Selection, Non-Linear Endogenous Models)
Epidemiological tables (Cohort Studies, Case-Control Studies and Matched Case-Control Studies)
Power analysis (Sample Size, Power Size and Effect Size)
Matrix operations (Matrix operators, Matrix functions, Matrix subscripting)
Learn about the basic Stata interface. What does each window do and how do you interact with them?
If there's ONE thing you need to know how to use it's how to use help. Once you can use help you can "help" yourself. Learn about the different ways you can find help inside and outside Stata.
This course is taught via the command syntax (code). Professionals do not use 'point-and-click' to do their work but write all their analysis in code. Learn about the basic Stata syntax and how it works.
Stata is modular and uses something called .do and .ado files to operate. Learn how these are related to Stata code and how you can interact with them.
Log files save all your work to a continuously updated log file. This keeps a record of your results and code in case something goes wrong. Learn how to start, end and view a log file in Stata.
Not all data comes in Stata's .dta format. Learn how to import other data types in Stata such as excel spreadsheets.
Unlikes Excel, Stata does not display the underlying raw data by default. Learn how to access and view the raw data and how to modify it.
Basic data analysis often requires summary statistics such as means, standard deviation and min/max values. Learn how to get a overview of your data and produce basic summary statistics.
Not all data is suited for means and standard deviation analysis. Some data requires tabulation or tables. Learn more about how to tabulate data and how to create custom tables in Stata.
Not all data is 100% perfect. Sometimes there are missing values. Learn how Stata handles missing values and how to detect missing data.
Analysing distributions is an important part of statistical analysis. Learn what commands are available to analyse data distributions numerical in Stata.
Survey data often contains weights that give some observations more or less significance. Learn how to use weights in statistical analysis in Stata.
Learn how to recode an existing variables with the recode command. This command is fast and easy to use and allows you to recode numerical values quickly.
A limitation of the recode command is that is has trouble using "if" conditions in data manipulation. Learn how to use the generate and replace command to generate new and replace existing variable using "if" conditions.
New data often requires renaming and labelling to avoid confusion. Learn how to attach labels and rename variables.
If you are looking for a very complex data manipulation then the EGEN command may help. This command offers more complex extensions to generate and allows you to perform complex data manipulations.
Indicator/Categorical data is an important data type. Learn how to construct and recode categorical variables.
Not all data needs to be retained. Learn how to remove (or keep) selected observations and variables in your data.
Learn how to set the file path and save your data in Stata. Learn how to export your data in other formats.
String data (data that contains non-numeric characters) can be challenging to deal with. Get a quick introduction on how you can format such data to be useable in data analysis.
Some datasets are split into multiple files. Learn how to merge and append different datasets together to create larger and more complex datasets.
Advanced users use custom macro's to speed up their workflow. Get an introduction to macro's and learn the basics of looping code over your data.
Learn the basics of graphing in Stata. How to load, save and use graphs and how to change their colour scheme.
Bar charts and dot charts are useful to plot summary statistics over categorical data. Learn how to create bar and dot charts in Stata and what options are most relevant.
Examining distributions visually is an important part of data analysis. Learn how to use histograms and kernel density plots in Stata.
Pie charts are used to display numerical proportions. Learn how to create and customise pie charts in Stata.
Scatterplots allow users to analysis bivariate relationships. Learn how to create scatterplots and overlay lines of best fit to analyse relationships between two variables visually.
Remember that old graphical calculator you used in school that could could create custom functions? Learn how to draw custom functions (e.g. y = 2x^2) in Stata. Learn how this might be useful in a regression scenario.
Contour plots allow 3-dimensional visualisation in 2-dimensions via contours. Learn how to use and apply contour plots to Statistical analysis in Stata. These can be useful to visualise complicated interaction effects.
Sometimes you need to fake a scatterplot, but retain the original structure. Learn how to use jitter to randomly distribute actual datapoint and why this might be useful
Learn how to combine multiple graphs into one graph. This can be a powerful way to increase the visual effectiveness of your work. Learn what options you need to know about when combining graphs in Stata.
Learn how to test the association (relationship) between two categorical variables in Stata using the tabulate command.
Learn how to perform basic means test across two variables. Is the mean of one variable really different to the mean of another variables? Learn how to perform group mean tests.
Learn how to perform bivariate correlation (Pearson's correlation) on two more variables.
A short introduction to basic ANOVA analysis in Stata.
A brief introduction and example of basic OLS regression in Stata.
Learn how to integrate categorical variables into your regression analysis in Stata.
Learn how to perform common diagnostic analysis and tests after a regression in Stata.
Learn how to run and interpret a log transformed regression. Learn how to use interaction terms in a regression.
Learn how to use the test command to perform hypothesis testing after a regression.
Learn how to output the results from a regression to other programmes such as Word or Excel and how to make your results look professional.
When a dependent variable is binary it requires a logit/probit regression model for analysis. Learn how to apply such models in Stata.
Learn basic diagnostic tools after a login/probit regression model including goodness-of-fit statistics.
Ranked or unranked categorical dependent variables require more complex non-linear models. Learn the basics of the ordered logit and multinomial legit regression models in Stata.
Learn about simulation and how to create basic random numbers in Stata.
Learn how we can use random variables to create fake data with known properties.
Learn how we can test the assumptions of a regression estimator with fake data that has properties that do not match the requirements of the estimator. What happens when our data misbehaves?
Learn about Monte Carlo Simulation and how professionals test the statistical properties of estimation procedures.
OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.
Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.
Find this site helpful? Tell a friend about us.
We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.
Your purchases help us maintain our catalog and keep our servers humming without ads.
Thank you for supporting OpenCourser.