This course will be retired on January 8, 2022.
Duplicated Data9:02 with Alyssa Batula
Sometimes a dataset may have the same information recorded multiple times. This duplicated data can cause problems with our analysis..
- Outlier - An example that is highly different from the majority of examples.
Data Source Information:
National Health and Nutrition Examination Survey (NHANES) main site
NHANES 1999-2000 information
Body Measurements Documentation/Codebook
Python and Pandas Resources:
You need to sign up for Treehouse in order to download course files.Sign up