- The importance of Learning on Your Own
- The importance of communication
- Functions and relations
- Every table represents a relation.
From Chapter 3 - Jupyter
- The structure of Jupyter
- How to shut down Jupyter
- Writing to a slice of a DataFrame
- Explanations before and after code
- The relationship between tall and wide data
- The value of abstraction in programming
- Why people use tools like git
- Vectorization and its benefits
- Models vs. fit models
- Visualizing relations vs. functions
- Informally, map is the same as apply
- Important phrases: map-reduce and split-apply-combine
- Concat adds rows and merge adds columns (usually!)
- Munging/ETL is a large portion of data work
- Information = Data + Context
- Summary of key points about missing values
- Uses for data dashboards
- A graph depicts a binary relation of a set with itself
- How pivoting/melting impacts graph data
- What is a recommender system?
- The SVD and approximation
- Supervised vs. unsupervised machine learning
- A central issue: overfitting vs. underfitting
- Why we split data into train and test sets