1 00:00:00,000 --> 00:00:09,934 [MUSIC] 2 00:00:09,934 --> 00:00:12,592 Howdy data analysts, my name is AJ. 3 00:00:12,592 --> 00:00:15,578 And I am an instructor at Treehouse. 4 00:00:15,578 --> 00:00:18,910 My pronouns include he and they. 5 00:00:18,910 --> 00:00:23,782 I'm excited to introduce you to a powerful data visualization library called 6 00:00:23,782 --> 00:00:25,470 Seaborn. 7 00:00:25,470 --> 00:00:29,866 Before we get started, make sure you have completed the prerequisite material for 8 00:00:29,866 --> 00:00:31,410 this course. 9 00:00:31,410 --> 00:00:34,540 You can find them in the teacher's notes below. 10 00:00:34,540 --> 00:00:38,862 Seaborn is a rich data visualization library that is built on 11 00:00:38,862 --> 00:00:41,782 top of the plotting library Matplotlib. 12 00:00:41,782 --> 00:00:48,290 It offers a set of high level tools for creating statistical charts and plots. 13 00:00:48,290 --> 00:00:52,656 It is more convenient than Matplotlib for quickly visualizing data, 14 00:00:52,656 --> 00:00:57,570 because it integrates well with Pandas DataFrame objects. 15 00:00:57,570 --> 00:01:01,888 Recall that Pandas is a popular data analysis library. 16 00:01:01,888 --> 00:01:09,011 And that a DataFrame represents tabular data, like what you would find in a table, 17 00:01:09,011 --> 00:01:13,617 spreadsheet, or a comma separated values CSV file. 18 00:01:13,617 --> 00:01:16,776 Seaborn will work with Pandas DataFrames and 19 00:01:16,776 --> 00:01:22,520 convert them under the hood into code that Matplotlib can understand. 20 00:01:22,520 --> 00:01:28,833 And Seaborn boasts a simplified interface compared to Matplotlib, making it easier 21 00:01:28,833 --> 00:01:34,910 to create a wide variety of high quality plots with fewer lines of code. 22 00:01:34,910 --> 00:01:40,689 In this introductory course, we will go over three different families of plots 23 00:01:40,689 --> 00:01:46,400 built into Seaborn: relational, distribution, and categorical plots. 24 00:01:48,250 --> 00:01:53,776 Relational plots are used for highlighting the relationship between two variables. 25 00:01:53,776 --> 00:01:58,110 In Seaborn, these are scatter plots and line plots. 26 00:01:58,110 --> 00:02:02,251 Distribution plots make it easy to see the shape of a data set, and 27 00:02:02,251 --> 00:02:06,370 help understand how its values are distributed. 28 00:02:06,370 --> 00:02:09,540 These include histograms and kernel density estimates. 29 00:02:10,910 --> 00:02:15,471 Seaborn has three sub families of categorical plots which help explore 30 00:02:15,471 --> 00:02:16,750 categorical data. 31 00:02:18,310 --> 00:02:23,200 Categorical data means variables that are words instead of numbers. 32 00:02:23,200 --> 00:02:27,440 Seaborn has categorical scatter plots, including strip and 33 00:02:27,440 --> 00:02:32,400 swarm plots, distribution plots, which include box and violin plots, 34 00:02:32,400 --> 00:02:36,490 and estimation plots, which include bar and count plots. 35 00:02:37,980 --> 00:02:42,550 Note that Seaborn includes more plots than we will go over during this course. 36 00:02:43,700 --> 00:02:50,807 The Seaborn API documentation and the tutorial are excellent starting points for 37 00:02:50,807 --> 00:02:55,998 getting to know all the different kinds of available plots. 38 00:02:55,998 --> 00:03:00,478 You can also take a look at the many additionally customized 39 00:03:00,478 --> 00:03:02,905 charts in the Seaborn gallery. 40 00:03:02,905 --> 00:03:03,867 In this course, 41 00:03:03,867 --> 00:03:08,423 I will guide you through an overview of different plots available in Seaborn. 42 00:03:08,423 --> 00:03:12,834 Then I will be using data from two Japanese games, 43 00:03:12,834 --> 00:03:18,098 Pokemon and Yugioh to perform exploratory data analysis. 44 00:03:18,098 --> 00:03:23,791 I will have examples and challenges for you based on these imported data sets. 45 00:03:23,791 --> 00:03:24,911 Are you ready? 46 00:03:24,911 --> 00:03:27,724 In the next series of instructions, 47 00:03:27,724 --> 00:03:31,451 we will begin with an overview of Seaborn plots. 48 00:03:31,451 --> 00:03:33,330 I'll catch you on the other side!