1.
How to Learn About Programming or Software
Engineering (2-3)
- Common data structures (data types, lists, dictionaries,
sets, tuples), writing functions, logic, control flow,
searching and sorting algorithms, object-oriented
programming, and working with external libraries.
- SQL scripting: Querying databases using joins,
aggregations, and subqueries → Leetcode
- Comfort using the Terminal, version control in Git, and
using GitHub
2. How to Learn About Data Collection and Wrangling
(Cleaning) (2)
- Collect data from a
Scripting - extracting data from the website, APIS, DBs:
website/API (open for public consumption) of your
choice, and transform the data to store it from different
sources into an aggregated file or table (DB). Example
APIs include TMDB, quandl, Twitter API, and so on.
- Pick any publicly available dataset and define a set of
questions that you’d want to pursue after looking at the
dataset and the domain. Wrangle the data to find out
answers to those questions using Pandas and NumPy.
- Data formatting
- Using library
- Data transformation
- Handle missing values
3. How to Learn About Exploratory Data Analysis,
Business Acumen, and Storytelling (2-3)
Specific exploratory data analysis and storytelling topics to learn include:
● Exploratory data analysis — defining questions,
handling missing values, outliers, formatting, filtering,
univariate and multivariate analysis.
● Data visualization — plotting data using libraries like
matplotlib, seaborn, and plotly. Know how to choose the
right chart to communicate the findings from the data.
● Developing dashboards — a good percent of analysts
only use Excel or a specialized tool like Power BI and
Tableau to build dashboards that summarise/aggregate
data to help management make decisions.
● Business acumen: Work on asking the right questions to
answer, ones that actually target the business metrics.
Practice writing clear and concise reports, blogs, and
presentations.
4. vds
5. c
6.