Data Science

Uploaded by

mmonisha2201

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

Data Science

Uploaded by

mmonisha2201

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Data science

The existence of Comet NEOWISE (here depicted as a series of red dots) was
discovered by analyzing astronomical survey data acquired by a space telescope,
the Wide-field Infrared Survey Explorer.
Data science is an interdisciplinary academic field[1] that uses statistics, scientific
computing, scientific methods, processing, scientific visualization, algorithms and
systems to extract or extrapolate knowledge from potentially noisy, structured,
or unstructured data.[2]

Data science also integrates domain knowledge from the underlying application domain
(e.g., natural sciences, information technology, and medicine).[3] Data science is
multifaceted and can be described as a science, a research paradigm, a research
method, a discipline, a workflow, and a profession.[4]

Data science is "a concept to unify statistics, data analysis, informatics, and their
related methods" to "understand and analyze actual phenomena" with data.[5] It uses
techniques and theories drawn from many fields within the context of mathematics,
statistics, computer science, information science, and domain knowledge.[6] However,
data science is different from computer science and information science. Turing
Award winner Jim Gray imagined data science as a "fourth paradigm" of science
(empirical, theoretical, computational, and now data-driven) and asserted that
"everything about science is changing because of the impact of information technology"
and the data deluge.[7][8]

A data scientist is a professional who creates programming code and combines it with
statistical knowledge to summarize data.[9]

Foundations
Data science is an interdisciplinary field[10] focused on extracting knowledge from
typically large data sets and applying the knowledge from that data to solve problems in
other application domains. The field encompasses preparing data for analysis,
formulating data science problems, analyzing data, and summarizing these findings. As
such, it incorporates skills from computer science, mathematics, data
visualization, graphic design, communication, and business.[11]

Vasant Dhar writes that statistics emphasizes quantitative data and description. In
contrast, data science deals with quantitative and qualitative data (e.g., from images,
text, sensors, transactions, customer information, etc.) and emphasizes prediction and
action.[12] Andrew Gelman of Columbia University has described statistics as a non-
essential part of data science.[13] Stanford professor David Donoho writes that data
science is not distinguished from statistics by the size of datasets or use of computing
and that many graduate programs misleadingly advertise their analytics and statistics
training as the essence of a data-science program. He describes data science as an
applied field growing out of traditional statistics.[14]
Etymology
Early usage
In 1962, John Tukey described a field he called "data analysis", which resembles
modern data science.[14] In 1985, in a lecture given to the Chinese Academy of Sciences
in Beijing, C. F. Jeff Wu used the term "data science" for the first time as an alternative
name for statistics.[15] Later, attendees at a 1992 statistics symposium at the University
of Montpellier II acknowledged the emergence of a new discipline focused on data of
various origins and forms, combining established concepts and principles of statistics
and data analysis with computing.[16][17]

The term "data science" has been traced back to 1974, when Peter Naur proposed it as
an alternative name to computer science.[6] In 1996, the International Federation of
Classification Societies became the first conference to specifically feature data science
as a topic.[6] However, the definition was still in flux. After the 1985 lecture at the
Chinese Academy of Sciences in Beijing, in 1997 C. F. Jeff Wu again suggested that
statistics should be renamed data science. He reasoned that a new name would help
statistics shed inaccurate stereotypes, such as being synonymous with accounting or
limited to describing data.[18] In 1998, Hayashi Chikio argued for data science as a new,
interdisciplinary concept, with three aspects: data design, collection, and analysis.[17]

Modern usage
In 2012, technologists Thomas H. Davenport and DJ Patil declared "Data Scientist: The
Sexiest Job of the 21st Century",[19] a catchphrase that was picked up even by major-city
newspapers like the New York Times[20] and the Boston Globe.[21] A decade later, they
reaffirmed it, stating that "the job is more in demand than ever with employers". [22]

The modern conception of data science as an independent discipline is sometimes

attributed to William S. Cleveland.[23] In 2014, the American Statistical Association's
Section on Statistical Learning and Data Mining changed its name to the Section on
Statistical Learning and Data Science, reflecting the ascendant popularity of data
science.[24]

The professional title of "data scientist" has been attributed to DJ Patil and Jeff
Hammerbacher in 2008.[25] Though it was used by the National Science Board in their
2005 report "Long-Lived Digital Data Collections: Enabling Research and Education in
the 21st Century", it referred broadly to any key role in managing a digital data
collection.[26]

Data science and data analysis

Example for the usefulness of exploratory data
analysis as demonstrated using the Datasaurus dozen data set
Data analysis typically involves working with structured datasets to answer specific
questions or solve specific problems. This can involve tasks such as data
cleaning and data visualization to summarize data and develop hypotheses about
relationships between variables. Data analysts typically use statistical methods to test
these hypotheses and draw conclusions from the data.[27]

Data science involves working with larger datasets that often require advanced
computational and statistical methods to analyze. Data scientists often work
with unstructured data such as text or images and use machine learning algorithms to
build predictive models. Data science often uses statistical analysis, data
preprocessing, and supervised learning.[28][29]

Cloud computing for data science

A cloud-based architecture for enabling big data

analytics. Data flows from various sources, such as personal computers, laptops,
and smart phones, through cloud services for processing and analysis, finally leading to
various big data applications.
Cloud computing can offer access to large amounts of computational power
and storage.[30] In big data, where volumes of information are continually generated and
processed, these platforms can be used to handle complex and resource-intensive
analytical tasks.[31]
Some distributed computing frameworks are designed to handle big data workloads.
These frameworks can enable data scientists to process and analyze large datasets in
parallel, which can reduce processing times.[32]

Ethical consideration in data science

Data science involves collecting, processing, and analyzing data which often includes
personal and sensitive information. Ethical concerns include potential privacy violations,
bias perpetuation, and negative societal impacts.[33][34]

FODS Unit 1 Fully
No ratings yet
FODS Unit 1 Fully
30 pages
Data Science Master
100% (1)
Data Science Master
2 pages
Introduction To Data Science Practical Approach With R and Python (B. Uma Maheswari, R. Sujatha) (Z-Library) - 8-28
No ratings yet
Introduction To Data Science Practical Approach With R and Python (B. Uma Maheswari, R. Sujatha) (Z-Library) - 8-28
21 pages
Data Science: Evolution and Impact
100% (2)
Data Science: Evolution and Impact
4 pages
Data Science
No ratings yet
Data Science
9 pages
Data Science
No ratings yet
Data Science
7 pages
Data Science Sample
No ratings yet
Data Science Sample
2 pages
Data Science
No ratings yet
Data Science
3 pages
Data Science
No ratings yet
Data Science
5 pages
Data Science Basics
No ratings yet
Data Science Basics
5 pages
Data Science-2
No ratings yet
Data Science-2
3 pages
Data Science Basics and History
100% (1)
Data Science Basics and History
51 pages
Data Science Intro
No ratings yet
Data Science Intro
6 pages
Data Science
No ratings yet
Data Science
1 page
Data Science - Wikipedia
No ratings yet
Data Science - Wikipedia
7 pages
Chirag Modi Data Science Report
No ratings yet
Chirag Modi Data Science Report
29 pages
23STUCHH010864
No ratings yet
23STUCHH010864
24 pages
FDS - Lecture Notes - III AIML, CSM
No ratings yet
FDS - Lecture Notes - III AIML, CSM
101 pages
History and Evolution of Data Science
No ratings yet
History and Evolution of Data Science
2 pages
Data Science
No ratings yet
Data Science
9 pages
Data Science
No ratings yet
Data Science
46 pages
Data Science: Evolution & Impact
No ratings yet
Data Science: Evolution & Impact
3 pages
Data Science - Wikipedia
No ratings yet
Data Science - Wikipedia
6 pages
The Transformative Role of Data Science in Contemporary Society
No ratings yet
The Transformative Role of Data Science in Contemporary Society
14 pages
A
No ratings yet
A
4 pages
PSAI Unit 1
No ratings yet
PSAI Unit 1
70 pages
Module 1 Introduction To DataScience and Analytics
No ratings yet
Module 1 Introduction To DataScience and Analytics
10 pages
DataScience Intro
No ratings yet
DataScience Intro
36 pages
Introduction to Data Science Course
No ratings yet
Introduction to Data Science Course
44 pages
Data Science Overview & Applications
No ratings yet
Data Science Overview & Applications
17 pages
Vickie Data Analytics
No ratings yet
Vickie Data Analytics
9 pages
Data Science Hype and Reality
No ratings yet
Data Science Hype and Reality
7 pages
Intro Lectures To DSA
0% (1)
Intro Lectures To DSA
17 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
20 pages
iMY DATA SCIENCE - Removed
No ratings yet
iMY DATA SCIENCE - Removed
19 pages
Data Science and Its Importance
No ratings yet
Data Science and Its Importance
9 pages
Data Science Notes - 1-PD
No ratings yet
Data Science Notes - 1-PD
17 pages
Data Science A Beginner S Guide 1668243666
100% (1)
Data Science A Beginner S Guide 1668243666
26 pages
Data Science
No ratings yet
Data Science
11 pages
Unit-1 Data Science
No ratings yet
Unit-1 Data Science
74 pages
Data Science Overview & Applications
No ratings yet
Data Science Overview & Applications
10 pages
Data Science: A Systematic Overview
No ratings yet
Data Science: A Systematic Overview
11 pages
TMP3413 Software Engineering Lab: Defining The Requirements
No ratings yet
TMP3413 Software Engineering Lab: Defining The Requirements
18 pages
What Is Data Science?: Michael L. Brodie
No ratings yet
What Is Data Science?: Michael L. Brodie
21 pages
Data Science vs. Statistics: Two Cultures?
No ratings yet
Data Science vs. Statistics: Two Cultures?
22 pages
1.1 Idml
No ratings yet
1.1 Idml
3 pages
OceanofPDF - Com DATA SCIENCE Simple and Effective Tips An - Benjamin Smith
100% (1)
OceanofPDF - Com DATA SCIENCE Simple and Effective Tips An - Benjamin Smith
122 pages
Data Science Unit 1
No ratings yet
Data Science Unit 1
24 pages
Ex 7
No ratings yet
Ex 7
2 pages
Cybersecurity Data Science Insights
No ratings yet
Cybersecurity Data Science Insights
27 pages
Introduction To Data Science Lecture 1
No ratings yet
Introduction To Data Science Lecture 1
4 pages
Final Seminar Report
100% (2)
Final Seminar Report
18 pages
IT General Controls Questionnaire
0% (1)
IT General Controls Questionnaire
6 pages
DW DM Notes
No ratings yet
DW DM Notes
107 pages
6822 Protecting Your Data With Windows 10 BitLocker
No ratings yet
6822 Protecting Your Data With Windows 10 BitLocker
6 pages
Ids Mod1
No ratings yet
Ids Mod1
21 pages
Carmichael MArron 2018 OJO
No ratings yet
Carmichael MArron 2018 OJO
22 pages
1) Data-Sci Chapter-1
No ratings yet
1) Data-Sci Chapter-1
17 pages
IDS Complete Notes
No ratings yet
IDS Complete Notes
126 pages
H3C CloudOS 5.0 CLoud Operation System Technical Withepaper - IaaS
No ratings yet
H3C CloudOS 5.0 CLoud Operation System Technical Withepaper - IaaS
53 pages
Google Cloud Logging Insights
No ratings yet
Google Cloud Logging Insights
64 pages
Intro to Data Science Basics
No ratings yet
Intro to Data Science Basics
18 pages
Juniper
No ratings yet
Juniper
68 pages
Ch7-Overview of Data Science-Part 1
No ratings yet
Ch7-Overview of Data Science-Part 1
37 pages
Data Collection and Preparation Exploratory Data Analysis (EDA) Machine Learning Data Visualization Model Deployment and Evaluation
No ratings yet
Data Collection and Preparation Exploratory Data Analysis (EDA) Machine Learning Data Visualization Model Deployment and Evaluation
10 pages
WINSEM2024-25 BITE401L TH VL2024250503090 2024-12-13 Reference-Material-I
No ratings yet
WINSEM2024-25 BITE401L TH VL2024250503090 2024-12-13 Reference-Material-I
35 pages
Embedded System Development Coding Reference Guide
100% (2)
Embedded System Development Coding Reference Guide
190 pages
Institute of Accountancy Arusha (IAA)
100% (1)
Institute of Accountancy Arusha (IAA)
23 pages
Security Analyst Resume
No ratings yet
Security Analyst Resume
2 pages
Arch Linux - Wikipedia
No ratings yet
Arch Linux - Wikipedia
5 pages
Data Science
No ratings yet
Data Science
5 pages
Furniture Shop Management System Project Report
100% (1)
Furniture Shop Management System Project Report
54 pages
Grade 12 Unit 3
No ratings yet
Grade 12 Unit 3
26 pages
COM Wrapper Tutorial for Custom Objects
No ratings yet
COM Wrapper Tutorial for Custom Objects
27 pages
Fiona
No ratings yet
Fiona
83 pages
DevOps Engineer Master's Program
No ratings yet
DevOps Engineer Master's Program
22 pages
En Raccoon Stealer Technical Analysis Report
No ratings yet
En Raccoon Stealer Technical Analysis Report
28 pages
300-710 Prepaway Premium Exam 76q
No ratings yet
300-710 Prepaway Premium Exam 76q
24 pages
Comprehensive Test Plan Guide
No ratings yet
Comprehensive Test Plan Guide
14 pages
97 Burp Suite Top 5 Community Edition Extensions
No ratings yet
97 Burp Suite Top 5 Community Edition Extensions
3 pages
PaloAlto Comparacao PDF
No ratings yet
PaloAlto Comparacao PDF
5 pages
CV - Muhroji Sutio
No ratings yet
CV - Muhroji Sutio
2 pages
Que 1. What Is Python?: 1) Easy To Learn and Use
No ratings yet
Que 1. What Is Python?: 1) Easy To Learn and Use
11 pages
HTML Meta Tags Guide
No ratings yet
HTML Meta Tags Guide
9 pages
Annex D: (Informative)
No ratings yet
Annex D: (Informative)
4 pages
Access Control Modes
No ratings yet
Access Control Modes
2 pages
Ignition User Manual
100% (1)
Ignition User Manual
566 pages

Data Science

Uploaded by

Data Science

Uploaded by

Data science

The modern conception of data science as an independent discipline is sometimes

Data science and data analysis

Cloud computing for data science

A cloud-based architecture for enabling big data

Ethical consideration in data science

You might also like