SkillAgentSearch skills...

DAT8

General Assembly's 2015 Data Science course in Washington, DC

Install / Use

/learn @justmarkham/DAT8

README

DAT8 Course Repository

Course materials for General Assembly's Data Science course in Washington, DC (8/18/15 - 10/29/15).

Instructor: Kevin Markham (Data School blog, email newsletter, YouTube channel)

Binder

Tuesday | Thursday --- | --- 8/18: Introduction to Data Science | 8/20: Command Line, Version Control 8/25: Data Reading and Cleaning | 8/27: Exploratory Data Analysis 9/1: Visualization | 9/3: Machine Learning 9/8: Getting Data | 9/10: K-Nearest Neighbors 9/15: Basic Model Evaluation | 9/17: Linear Regression 9/22: First Project Presentation | 9/24: Logistic Regression 9/29: Advanced Model Evaluation | 10/1: Naive Bayes and Text Data 10/6: Natural Language Processing | 10/8: Kaggle Competition 10/13: Decision Trees | 10/15: Ensembling 10/20: Advanced scikit-learn, Clustering | 10/22: Regularization, Regex 10/27: Course Review | 10/29: Final Project Presentation

<!-- ### Before the Course Begins * Install [Git](http://git-scm.com/downloads). * Create an account on the [GitHub](https://github.com/) website. * It is not necessary to download "GitHub for Windows" or "GitHub for Mac" * Install the [Anaconda distribution](http://continuum.io/downloads) of Python 2.7x. * If you choose not to use Anaconda, here is a list of the [Python packages](other/python_packages.md) you will need to install during the course. * We would like to check the setup of your laptop before the course begins: * You can have your laptop checked before the intermediate Python workshop on Tuesday 8/11 (5:30pm-6:30pm), at the [15th & K Starbucks](http://www.yelp.com/biz/starbucks-washington-15) on Saturday 8/15 (1pm-3pm), or before class on Tuesday 8/18 (5:30pm-6:30pm). * Alternatively, you can walk through the [setup checklist](other/setup_checklist.md) yourself. * Once you receive an email invitation from Slack, join our "DAT8 team" and add your photo. * Practice Python using the resources below. -->

Python Resources

<!-- ### Submission Forms * [Feedback form](http://bit.ly/dat8feedback) * [Homework and project submissions](http://bit.ly/dat8homework) -->

Course project

Comparison of machine learning models

Comparison of model evaluation procedures and metrics

Advice for getting better at data science

Additional resources


Class 1: Introduction to Data Science

Homework:

  • Work through GA's friendly command line tutorial using Terminal (Linux/Mac) or Git Bash (Windows).
  • Read through this command line reference, and complete the pre-class exercise at the bottom. (There's nothing you need to submit once you're done.)
  • Watch videos 1 through 8 (21 minutes) of Introduction to Git and GitHub, or read sections 1.1 through 2.2 of Pro Git.
  • If your laptop has any setup issues, please work with us to resolve them by Thursday. If your laptop has not yet been checked, you should come early on Thursday, or just walk through the setup checklist yourself (and let us know you have done so).

Resources:


Class 2: Command Line and Version Control

  • Slack tour
  • Review the command line pre-class exercise (code)
  • Git and GitHub (slides)
  • Intermediate command line

Homework:

Git and Markdown Resources:

  • Pro Git is an excellent book for learning Git. Read the first two chapters to gain a deeper understanding of version control and basic commands.
  • If you want to practice a lot of Git (and learn many more commands), Git Immersion looks promising.
  • If you want to understand how to contribute on GitHub, you first have to understand forks and pull requests.
  • GitRef is my favorite reference guide for Git commands, and [Git quick reference for beginners](http:

Related Skills

View on GitHub
GitHub Stars1.6k
CategoryData
Updated2d ago
Forks1.1k

Languages

Jupyter Notebook

Security Score

85/100

Audited on Apr 8, 2026

No findings