3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.
23 - Jupyter Notebook
In [1]:
1 import pandas as pd
In [2]:
1 df=pd.read_csv('IPL_Matches_2008_2022.csv')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 1/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [3]:
1 df
Out[3]:
ID City Date Season MatchNumber Team1 Team2 Ven
Narend
2022- Rajasthan Gujarat Mo
0 1312200 Ahmedabad 2022 Final
05-29 Royals Titans Stadiu
Ahmedab
Narend
Royal
2022- Rajasthan Mo
1 1312199 Ahmedabad 2022 Qualifier 2 Challengers
05-27 Royals Stadiu
Bangalore
Ahmedab
Royal Lucknow Ed
2022-
2 1312198 Kolkata 2022 Eliminator Challengers Super Garden
05-25
Bangalore Giants Kolka
Ed
2022- Rajasthan Gujarat
3 1312197 Kolkata 2022 Qualifier 1 Garden
05-24 Royals Titans
Kolka
Wankhe
2022- Sunrisers Punjab
4 1304116 Mumbai 2022 70 Stadiu
05-22 Hyderabad Kings
Mumb
... ... ... ... ... ... ... ...
Kolkata
2008- Deccan Ed
945 335986 Kolkata 2007/08 4 Knight
04-20 Chargers Garde
Riders
Royal
2008- Mumbai Wankhe
946 335985 Mumbai 2007/08 5 Challengers
04-20 Indians Stadiu
Bangalore
2008- Delhi Rajasthan Feroz Sh
947 335984 Delhi 2007/08 3
04-19 Daredevils Royals Ko
Punj
Chennai Crick
2008- Kings XI
948 335983 Chandigarh 2007/08 2 Super Associati
04-19 Punjab
Kings Stadiu
Moh
Royal Kolkata
2008-
949 335982 Bangalore 2007/08 1 Challengers Knight Chinnaswam
04-18
Bangalore Riders Stadiu
950 rows × 20 columns
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 2/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [4]:
1 df1=pd.read_csv('IPL_Ball_by_Ball_2008_2022.csv')
2 df1
Out[4]:
non-
ID innings overs ballnumber batter bowler extra_type batsm
striker
YBK Mohammed JC
0 1312200 1 0 1 NaN
Jaiswal Shami Buttler
YBK Mohammed JC
1 1312200 1 0 2 legbyes
Jaiswal Shami Buttler
JC Mohammed YBK
2 1312200 1 0 3 NaN
Buttler Shami Jaiswal
YBK Mohammed JC
3 1312200 1 0 4 NaN
Jaiswal Shami Buttler
YBK Mohammed JC
4 1312200 1 0 5 NaN
Jaiswal Shami Buttler
... ... ... ... ... ... ... ... ...
P SB
225949 335982 2 14 5 I Sharma legbyes
Kumar Joshi
SB P
225950 335982 2 14 6 I Sharma NaN
Joshi Kumar
P SB
225951 335982 2 14 7 I Sharma NaN
Kumar Joshi
SB P
225952 335982 2 15 1 LR Shukla wides
Joshi Kumar
SB P
225953 335982 2 15 2 LR Shukla NaN
Joshi Kumar
225954 rows × 17 columns
In [7]:
1 df.columns
Out[7]:
Index(['ID', 'City', 'Date', 'Season', 'MatchNumber', 'Team1', 'Team2',
'Venue', 'TossWinner', 'TossDecision', 'SuperOver', 'WinningTeam',
'WonBy', 'Margin', 'method', 'Player_of_Match', 'Team1Players',
'Team2Players', 'Umpire1', 'Umpire2'],
dtype='object')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 3/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [8]:
1 df.isnull()
Out[8]:
ID City Date Season MatchNumber Team1 Team2 Venue TossWinner TossDe
0 False False False False False False False False False
1 False False False False False False False False False
2 False False False False False False False False False
3 False False False False False False False False False
4 False False False False False False False False False
... ... ... ... ... ... ... ... ... ...
945 False False False False False False False False False
946 False False False False False False False False False
947 False False False False False False False False False
948 False False False False False False False False False
949 False False False False False False False False False
950 rows × 20 columns
In [9]:
1 df.isnull().sum()
Out[9]:
ID 0
City 51
Date 0
Season 0
MatchNumber 0
Team1 0
Team2 0
Venue 0
TossWinner 0
TossDecision 0
SuperOver 4
WinningTeam 4
WonBy 0
Margin 18
method 931
Player_of_Match 4
Team1Players 0
Team2Players 0
Umpire1 0
Umpire2 0
dtype: int64
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 4/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [10]:
1 df['WinningTeam'].fillna('NA')
Out[10]:
0 Gujarat Titans
1 Rajasthan Royals
2 Royal Challengers Bangalore
3 Gujarat Titans
4 Punjab Kings
...
945 Kolkata Knight Riders
946 Royal Challengers Bangalore
947 Delhi Daredevils
948 Chennai Super Kings
949 Kolkata Knight Riders
Name: WinningTeam, Length: 950, dtype: object
In [11]:
1 df['WinningTeam']=df['WinningTeam'].fillna('NA')
In [12]:
1 df.isnull().sum()
Out[12]:
ID 0
City 51
Date 0
Season 0
MatchNumber 0
Team1 0
Team2 0
Venue 0
TossWinner 0
TossDecision 0
SuperOver 4
WinningTeam 0
WonBy 0
Margin 18
method 931
Player_of_Match 4
Team1Players 0
Team2Players 0
Umpire1 0
Umpire2 0
dtype: int64
In [15]:
1 df[df['SuperOver'].isnull()].index
Out[15]:
Int64Index([205, 437, 464, 708], dtype='int64')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 5/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [16]:
1 df=df.drop(df[df['SuperOver'].isnull()].index)
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 6/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [17]:
1 df
Out[17]:
Team2 Venue TossWinner TossDecision SuperOver WinningTeam WonBy Margin
Narendra
Gujarat Modi Rajasthan
bat N Gujarat Titans Wickets 7.0
Titans Stadium, Royals
Ahmedabad
Narendra
Rajasthan Modi Rajasthan Rajasthan
field N Wickets 7.0
Royals Stadium, Royals Royals
Ahmedabad
Lucknow Eden Lucknow Royal
Super Gardens, Super field N Challengers Runs 14.0
Giants Kolkata Giants Bangalore
Eden
Gujarat Gujarat
Gardens, field N Gujarat Titans Wickets 7.0
Titans Titans
Kolkata
Wankhede
Punjab Sunrisers
Stadium, bat N Punjab Kings Wickets 5.0
Kings Hyderabad
Mumbai
... ... ... ... ... ... ... ..
Deccan Eden Deccan Kolkata Knight
bat N Wickets 5.0
Chargers Gardens Chargers Riders
Royal Royal
Wankhede Mumbai
Challengers bat N Challengers Wickets 5.0
Stadium Indians
Bangalore Bangalore
Rajasthan Feroz Shah Rajasthan Delhi
bat N Wickets 9.0
Royals Kotla Royals Daredevils
Punjab
Chennai Cricket
Chennai Chennai
Super Association bat N Runs 33.0
Super Kings Super Kings
Kings Stadium,
Mohali
Kolkata M Royal
Kolkata Knight
Knight Chinnaswamy Challengers field N Runs 140.0
Riders
Riders Stadium Bangalore
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 7/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [18]:
1 df.isnull().sum()
Out[18]:
ID 0
City 51
Date 0
Season 0
MatchNumber 0
Team1 0
Team2 0
Venue 0
TossWinner 0
TossDecision 0
SuperOver 0
WinningTeam 0
WonBy 0
Margin 14
method 927
Player_of_Match 0
Team1Players 0
Team2Players 0
Umpire1 0
Umpire2 0
dtype: int64
In [19]:
1 df['WinningTeam'].value_counts()
Out[19]:
Mumbai Indians 131
Chennai Super Kings 121
Kolkata Knight Riders 114
Royal Challengers Bangalore 109
Rajasthan Royals 96
Kings XI Punjab 88
Sunrisers Hyderabad 75
Delhi Daredevils 67
Delhi Capitals 36
Deccan Chargers 29
Gujarat Lions 13
Punjab Kings 13
Pune Warriors 12
Gujarat Titans 12
Rising Pune Supergiant 10
Lucknow Super Giants 9
Kochi Tuskers Kerala 6
Rising Pune Supergiants 5
Name: WinningTeam, dtype: int64
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 8/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [20]:
1 df['WinningTeam']=df['WinningTeam'].replace(['Kings XI Punjab', 'Deccan Chargers','D
2 ['Punjab Kings' ,'Sunrisers Hyderabad', 'Delhi Capitals','Gujarat Titans','Rising Pu
In [21]:
1 df['WinningTeam'].value_counts()
Out[21]:
Mumbai Indians 131
Chennai Super Kings 121
Kolkata Knight Riders 114
Royal Challengers Bangalore 109
Sunrisers Hyderabad 104
Delhi Capitals 103
Punjab Kings 101
Rajasthan Royals 96
Rising Pune Supergiants 27
Gujarat Titans 25
Lucknow Super Giants 9
Kochi Tuskers Kerala 6
Name: WinningTeam, dtype: int64
In [22]:
1 df.columns
Out[22]:
Index(['ID', 'City', 'Date', 'Season', 'MatchNumber', 'Team1', 'Team2',
'Venue', 'TossWinner', 'TossDecision', 'SuperOver', 'WinningTeam',
'WonBy', 'Margin', 'method', 'Player_of_Match', 'Team1Players',
'Team2Players', 'Umpire1', 'Umpire2'],
dtype='object')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 9/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [23]:
1 import seaborn as sns
2 sns.countplot(data=df,y='WinningTeam')
C:\ProgramData\Anaconda3\lib\site-packages\scipy\__init__.py:146: UserWarn
ing: A NumPy version >=1.16.5 and <1.23.0 is required for this version of
SciPy (detected version 1.24.1
warnings.warn(f"A NumPy version >={np_minversion} and <{np_maxversion}"
Out[23]:
<AxesSubplot:xlabel='count', ylabel='WinningTeam'>
In [24]:
1 df['WinningTeam'].value_counts().index
Out[24]:
Index(['Mumbai Indians', 'Chennai Super Kings', 'Kolkata Knight Riders',
'Royal Challengers Bangalore', 'Sunrisers Hyderabad', 'Delhi Capita
ls',
'Punjab Kings', 'Rajasthan Royals', 'Rising Pune Supergiants',
'Gujarat Titans', 'Lucknow Super Giants', 'Kochi Tuskers Kerala'],
dtype='object')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 10/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [25]:
1 sns.countplot(data=df,y='WinningTeam',
2 order=df['WinningTeam'].value_counts().index)
Out[25]:
<AxesSubplot:xlabel='count', ylabel='WinningTeam'>
In [26]:
1 sns.countplot(data=df,y='Player_of_Match',
2 order=df['Player_of_Match'].value_counts().index)
Out[26]:
<AxesSubplot:xlabel='count', ylabel='Player_of_Match'>
In [29]:
1 df['Player_of_Match'].value_counts().head(10).index
Out[29]:
Index(['AB de Villiers', 'CH Gayle', 'DA Warner', 'RG Sharma', 'MS Dhoni',
'YK Pathan', 'SR Watson', 'KA Pollard', 'SK Raina', 'V Kohli'],
dtype='object')
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 11/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [30]:
1 sns.countplot(data=df,y='Player_of_Match',
2 order=df['Player_of_Match'].value_counts().head(10).index)
Out[30]:
<AxesSubplot:xlabel='count', ylabel='Player_of_Match'>
In [31]:
1 import plotly.express as ex
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 12/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [32]:
1 ex.box(data_frame=df,x='Margin')
In [33]:
1 from random import randint
2 import numpy as np
In [35]:
1 for i in range(10):
2 print(randint(1,40))
29
26
1
38
25
15
10
26
2
14
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 13/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
In [36]:
1 names=['Sudeep','Sanjay','Abinesh','Sudarsan','Ramkumar']
2 votes=np.random.randint(1000,10000,(5,))
3 print(list(zip(names,votes)))
[('Sudeep', 1882), ('Sanjay', 2397), ('Abinesh', 1522), ('Sudarsan', 971
0), ('Ramkumar', 3916)]
In [38]:
1 import matplotlib.pyplot as plt
2 plt.pie(votes,labels=names)
3 plt.show()
In [42]:
1 import matplotlib.pyplot as plt
2 plt.pie(x=votes,labels=names,autopct="%.2f%%",explode=[0,0,0,.1,0]
3 ,colors=['black','blue','red','yellow','green'])
4 plt.show()
In [ ]:
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 14/15
3/16/23, 2:03 PM BCA II Year B Section Session 02 - 16.03.23 - Jupyter Notebook
localhost:8888/notebooks/BCA II Year B Section Session 02 - 16.03.23.ipynb 15/15