Practical SQL 2e (Sample Chapter) © 2021 by Anthony DeBarros
3
B E G I N N I N G DATA
E X P L O R AT I O N W I T H S E L E C T
For me, the best part of digging into data
isn’t the prerequisites of gathering, loading,
or cleaning the data, but when I actually get
to interview the data. Those are the moments
when I discover whether the data is clean or dirty,
whether it’s complete, and, most of all, what story the
data can tell. Think of interviewing data as a process
akin to interviewing a person applying for a job. You
want to ask questions that reveal whether the reality
of their expertise matches their résumé.
Interviewing the data is exciting because you discover truths. For
e xample, you might find that half the respondents forgot to fill out the
email field in the questionnaire, or the mayor hasn’t paid property taxes
for the past five years. Or you might learn that your data is dirty: names are
spelled inconsistently, dates are incorrect, or numbers don’t jibe with your
expectations. Your findings become part of the data’s story.