00 Dm2 Python Libraries4data Science 2020

The document lists the top 20 Python libraries for data science, categorized into core libraries, visualization, data mining, machine learning, deep learning, natural language processing, and data scraping. Key libraries include NumPy, Pandas, Matplotlib, Scikit-learn, TensorFlow, and NLTK, each serving specific functions like data manipulation, visualization, and machine learning. The document provides brief descriptions and links for further exploration of each library.

Uploaded by

sohail 32

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views7 pages

00 Dm2 Python Libraries4data Science 2020

Uploaded by

sohail 32

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Top 20 Python Libraries for

Data Science
Core Libraries & Statistics
NumPy (http://www.numpy.org/)
• It is intended for processing large multidimensional arrays and matrices, and
an extensive collection of high-level mathematical functions and implemented
methods makes it possible to perform various operations with these objects.
SciPy (https://scipy.org/scipylib/)
• It is based on NumPy and therefore extends its capabilities. SciPy main data
structure is again a multidimensional array, implemented by Numpy. The
package contains tools that help with solving linear algebra, probability
theory, integral calculus and many more tasks.
Pandas (https://pandas.pydata.org/)
• Pandas provides high-level data structures and a vast variety of tools for analysis. The
great feature of this package is the ability to translate rather complex operations with
data into one or two commands. Pandas contains many built-in methods for grouping,
filtering, and combining data, as well as the time-series functionality.
Visualization
Matplotlib (https://matplotlib.org/index.html)
• Matplotlib is a low-level library for creating two-dimensional diagrams and graphs. With
its help, you can build diverse charts, from histograms and scatterplots to non-Cartesian
coordinates graphs. Moreover, many popular plotting libraries are designed to work in
conjunction with matplotlib.
Seaborn (https://seaborn.pydata.org/)
• Seaborn is essentially a higher-level API based on the matplotlib library. It contains more
suitable default settings for processing charts. Also, there is a rich gallery of visualizations
including some complex types like time series, jointplots, and violin diagrams.
Plotly (https://plot.ly/python/)
• Plotly is a popular library that allows you to build sophisticated graphics easily. The
package is adapted to work in interactive web applications. Among its remarkable
visualizations are contour graphics, ternary plots, and 3D charts.
Bokeh (https://bokeh.pydata.org/en/latest/)
• The Bokeh library creates interactive and scalable visualizations in a browser using
JavaScript widgets. The library provides a versatile collection of graphs, styling
possibilities, interaction abilities in the form of linking plots, adding widgets, and defining
callbacks, and many more useful features.
Data Mining & Machine Learning
Scikit-learn (https://scikit-learn.org/stable/)
• This Python module based on NumPy and SciPy is one of the best libraries for
working with data. It provides algorithms for many standard machine learning and
data mining tasks such as clustering, regression, classification, dimensionality
reduction, and model selection.
PyFim (http://www.borgelt.net/pyfim.html)
• PyFIM is an extension module that makes several frequent item set mining
implementations available as functions. Currently apriori, eclat, fpgrowth, sam, relim,
carpenter, ista, accretion and apriacc are available as functions, although the
interfaces do not offer all of the options of the command line program.
Eli5 (https://eli5.readthedocs.io/en/latest/)
• Often the results of machine learning models predictions are not entirely clear, and
this is the challenge that eli5 library helps to deal with. It is a package for visualization
and debugging machine learning models and tracking the work of an algorithm step
by step. It provides support for scikit-learn, XGBoost, LightGBM, lightning, and
sklearn-crfsuite libraries and performs the different tasks for each of them.
Deep Learning
TensorFlow (https://www.tensorflow.org/)
• TensorFlow is a popular framework for deep and machine learning, developed in Google Brain.
It provides abilities to work with artificial neural networks with multiple data sets. Among the
most popular TensorFlow applications are object identification, speech recognition, and more.
PyTorch (https://pytorch.org/)
• PyTorch is a large framework that allows you to perform tensor computations with GPU acceleration, create
dynamic computational graphs and automatically calculate gradients. Above this, PyTorch offers a rich API
for solving applications related to neural networks.

Keras (https://keras.io/)
• Keras is a high-level library for working with neural networks, running on top of TensorFlow,
Theano, and now as a result of the new releases. It simplifies many specific tasks and greatly
reduces the amount of monotonous code. However, it may not be suitable for some
complicated things.
Dist-keras (https://joerihermans.com/work/distributed-keras/)
• dist-keras and others are gaining popularity and developing rapidly, and it is very difficult to single out
one of the libraries since they are all designed to solve a common task. These packages allow you to
train neural networks based on the Keras library directly with the help of Apache Spark.
Natural Language Processing & Data Scraping
NLTK (https://www.nltk.org/)
• NLTK is a set of libraries, a whole platform for natural language processing.
With the help of NLTK, you can process and analyze text in a variety of ways,
tokenize and tag it, extract information, etc. NLTK is also used for
prototyping and building research systems.
Gensim (https://radimrehurek.com/gensim/)
• Gensim is a Python library for robust semantic analysis, topic modeling and
vector-space modeling, and is built upon Numpy and Scipy. It provides an
implementation of popular NLP algorithms, such as word2vec. Although
gensim has its own models.wrappers.fasttext implementation, the fasttext
library can also be used for efficient learning of word representations.
Scrapy (https://scrapy.org/)
• Scrapy is a library used to create spiders bots that scan website pages and
collect structured data. In addition, Scrapy can extract data from the API. The
library happens to be very handy due to its extensibility and portability.
Thank you
https://www.kdnuggets.com/2018/06/top-20-python-libraries-data-science-
2018.html

Python Libraries for Developers
No ratings yet
Python Libraries for Developers
2 pages
Python Libraries
No ratings yet
Python Libraries
9 pages
Data Science
No ratings yet
Data Science
17 pages
Lecture 4
No ratings yet
Lecture 4
33 pages
Python Libraries
No ratings yet
Python Libraries
12 pages
Casestudy ML
No ratings yet
Casestudy ML
4 pages
Chapter-5 DS
No ratings yet
Chapter-5 DS
2 pages
Pai 6
No ratings yet
Pai 6
17 pages
Py Libs
No ratings yet
Py Libs
8 pages
MySQL Backup & Recovery Basics
No ratings yet
MySQL Backup & Recovery Basics
15 pages
Staple Python Libraries For Data Science
No ratings yet
Staple Python Libraries For Data Science
26 pages
Python Libraries For ML
No ratings yet
Python Libraries For ML
2 pages
Basic Libraries For Data Science
No ratings yet
Basic Libraries For Data Science
4 pages
Expt-1 Dav
No ratings yet
Expt-1 Dav
5 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
Chapter 6 Python Libraries For Machine Learning
No ratings yet
Chapter 6 Python Libraries For Machine Learning
21 pages
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
No ratings yet
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
6 pages
Python-Libraries SEMINAR
No ratings yet
Python-Libraries SEMINAR
12 pages
Data Analysis Library: by Muthu Priya J 19MZ06
No ratings yet
Data Analysis Library: by Muthu Priya J 19MZ06
3 pages
PDF 1675791423
No ratings yet
PDF 1675791423
11 pages
Python Libs For Ds
No ratings yet
Python Libs For Ds
5 pages
Top 18 Python Libraries for Data Science
100% (1)
Top 18 Python Libraries for Data Science
11 pages
Top 20 Trending Python Libraries
No ratings yet
Top 20 Trending Python Libraries
15 pages
Essential Python Libraries For Data Science 1694045951
No ratings yet
Essential Python Libraries For Data Science 1694045951
7 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
Libraries For Data Science
No ratings yet
Libraries For Data Science
2 pages
Python Libraries
No ratings yet
Python Libraries
17 pages
Practical 1
No ratings yet
Practical 1
8 pages
In Python, A Library Is A Collection of Pre-Writt...
No ratings yet
In Python, A Library Is A Collection of Pre-Writt...
3 pages
Pre ML Practise
No ratings yet
Pre ML Practise
14 pages
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
No ratings yet
An Overview and Comparison of Free Python Libraries For Data Mining and Big Data Analysis
6 pages
Python Libraries For Data Science
No ratings yet
Python Libraries For Data Science
6 pages
The Most Popular Python Libraries
No ratings yet
The Most Popular Python Libraries
7 pages
Top 20 Python Libraries For Data Science
No ratings yet
Top 20 Python Libraries For Data Science
15 pages
Python For Data Analysis The Python Crash Course Comprehensive The Programming From The Ground Up To Python by Cannon, Jason
No ratings yet
Python For Data Analysis The Python Crash Course Comprehensive The Programming From The Ground Up To Python by Cannon, Jason
167 pages
Numpy: Explanation
No ratings yet
Numpy: Explanation
21 pages
DDI Book Chapter Tools and Techniques
No ratings yet
DDI Book Chapter Tools and Techniques
13 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
40 Most Popular Python Scientific Libraries
No ratings yet
40 Most Popular Python Scientific Libraries
9 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
49 pages
15 Python Libraries For Data Science
No ratings yet
15 Python Libraries For Data Science
17 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
Python Libraries For Data Science
No ratings yet
Python Libraries For Data Science
10 pages
iGCSE Biology Study Guide
100% (1)
iGCSE Biology Study Guide
4 pages
Data Visualization
No ratings yet
Data Visualization
25 pages
Machine Learning Document
No ratings yet
Machine Learning Document
7 pages
Done Assignment
No ratings yet
Done Assignment
9 pages
PYTHON
No ratings yet
PYTHON
11 pages
GS 150
No ratings yet
GS 150
72 pages
Introduction To Popular-1
No ratings yet
Introduction To Popular-1
15 pages
Python Libraries For Data Science
No ratings yet
Python Libraries For Data Science
10 pages
DL Exp1
No ratings yet
DL Exp1
8 pages
Selling Task % Weight of Task in Sales Process % Advertising Contribution To Task Advertising's Contribution To Sales Estimated Estimated Projected
100% (1)
Selling Task % Weight of Task in Sales Process % Advertising Contribution To Task Advertising's Contribution To Sales Estimated Estimated Projected
2 pages
Important Libraries For Data Science
No ratings yet
Important Libraries For Data Science
29 pages
Canon Irc2380i Irc3080 Irc3080i Irc3580 Irc3580i Brochure
No ratings yet
Canon Irc2380i Irc3080 Irc3080i Irc3580 Irc3580i Brochure
8 pages
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
No ratings yet
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
1 page
Data Ty
No ratings yet
Data Ty
59 pages
Personal Dynamics Part A
No ratings yet
Personal Dynamics Part A
20 pages
Minnesota Waterfowl Regulations 2023
No ratings yet
Minnesota Waterfowl Regulations 2023
32 pages
Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
COE301 Lab 11 Datapath Component Design
No ratings yet
COE301 Lab 11 Datapath Component Design
7 pages
TY FDS Workbook
No ratings yet
TY FDS Workbook
56 pages
Data Preprocessing-AIML Algorithm1
No ratings yet
Data Preprocessing-AIML Algorithm1
47 pages
Python Essentials for Data Science
No ratings yet
Python Essentials for Data Science
8 pages
Medan LPG Terminal Overview
100% (1)
Medan LPG Terminal Overview
38 pages
2022 Article 3361
No ratings yet
2022 Article 3361
18 pages
Tendrel Nyesel - Rigpa Wiki052150
No ratings yet
Tendrel Nyesel - Rigpa Wiki052150
6 pages
EWD Camry 2006
No ratings yet
EWD Camry 2006
400 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
Reoi Construction Supervision Services Leseru-Kitale Morpus-Lokichar - 28.3.2025
100% (1)
Reoi Construction Supervision Services Leseru-Kitale Morpus-Lokichar - 28.3.2025
3 pages
Grade 9 Chapter 10 Review Exercise
No ratings yet
Grade 9 Chapter 10 Review Exercise
6 pages
Government Arts College Salem-7
No ratings yet
Government Arts College Salem-7
2 pages
The Genius Guide To - Divine Archetypes
100% (1)
The Genius Guide To - Divine Archetypes
18 pages
MA6452 S&NM 1 - by Civildatas - Com 12
No ratings yet
MA6452 S&NM 1 - by Civildatas - Com 12
50 pages
Ethiopian Construction Claims Study
100% (1)
Ethiopian Construction Claims Study
128 pages
AR-M208 Service Manual
No ratings yet
AR-M208 Service Manual
32 pages
Cleaning Validation MACO Swab Rinse Ovais v1.1
No ratings yet
Cleaning Validation MACO Swab Rinse Ovais v1.1
8 pages
PGECET College Lsit 2023 Me Mtech
No ratings yet
PGECET College Lsit 2023 Me Mtech
24 pages
Mysterious Loan Request at Bank
No ratings yet
Mysterious Loan Request at Bank
28 pages
Images Line Drawings and Backplanes
No ratings yet
Images Line Drawings and Backplanes
27 pages
CCC Professional Cloud Security Manager
No ratings yet
CCC Professional Cloud Security Manager
32 pages
Vipin Kumar Resume
No ratings yet
Vipin Kumar Resume
1 page
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
No ratings yet
Ross Girshick Et Al - in 2013 Proposed An Architecture Called R-CNN (Region
6 pages
Pega CSSA Cheat Sheet For OOTB Rules
No ratings yet
Pega CSSA Cheat Sheet For OOTB Rules
4 pages
Falke Talk - The Falke 80 - 90 Serial No Database - 03
No ratings yet
Falke Talk - The Falke 80 - 90 Serial No Database - 03
5 pages
Final Program - LSB Pinning Ceremony 2024
No ratings yet
Final Program - LSB Pinning Ceremony 2024
4 pages
2006-12-31: Overall Conclusion For The Year of 'Arise and Shine'
No ratings yet
2006-12-31: Overall Conclusion For The Year of 'Arise and Shine'
6 pages

00 Dm2 Python Libraries4data Science 2020

Uploaded by

00 Dm2 Python Libraries4data Science 2020

Uploaded by

Top 20 Python Libraries for

You might also like