Final Project Resources
-
Python and Command Line Fundamentals
- Introduction to command line, resources for using the shell, and command line cheatsheet
- Setting up a virtual environment
-
Ian Milligan and James Baker, “Introduction to the Bash Command Line,” The Programming Historian 3 (2014), https://programminghistorian.org/en/lessons/intro-to-bash.
- Introduction to Python and additional introduction
- Python cheatsheet of data structures
- Python data flows and dh tools example
- Introduction to Python Classes and classes example
-
Web Scraping, HTML, and HTTPS
- Introduction to the html and web scraping
- In class example of BeautifulSoup and example of requests
- Introduction to HTTPS
- Jeri Wieringa, “Intro to Beautiful Soup,” The Programming Historian 1 (2012), https://programminghistorian.org/en/lessons/intro-to-beautiful-soup
-
Pandas and EDA
- Introduction to Pandas and accompanying notebook. Also intro to Jupyter notebook and notebook shortcuts
- Advanced Pandas resources and pandas web scraping notebook
- Introduction to cleaning data and EDA
- Wes McKinney. Python for Data Analysis. 2017. Chapters 2, 5 (Available online through the Princeton Library)
-
Text Analysis
- Introduction to text analysis and introduction to spacy, along with class notebook
- Identify NER script example
- Matt Lavin “Analyzing Documents with TF-IDF” https://programminghistorian.org/en/lessons/analyzing-documents-with-tfidf
- Zoë Wilkinson Saldaña “Sentiment Analysis for Exploratory Data Analysis” https://programminghistorian.org/en/lessons/sentiment-analysis
- François Dominic Laramée “Introduction to stylometry with Python” https://programminghistorian.org/en/lessons/introduction-to-stylometry-with-python
-
Data Visualizations and Publishing Charts
- Introduction to Altair and notebook, along with class exercises
- Publishing Altair charts