/ Python And R Data science skills: 60 csv file read home work 01

Saturday 10 February 2018

60 csv file read home work 01

https://vlrtraining.com/courses/python-data-science-beginner-tutorial 60 csv file read home work 01
In [1]:
import pandas as pd
sal = pd.read_csv('Salaries.csv')
In [2]:
sal = pd.read_csv('Salaries.csv')
In [3]:
#sal
#sal
In [12]:
sal.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 148654 entries, 0 to 148653
Data columns (total 13 columns):
Id                  148654 non-null int64
EmployeeName        148654 non-null object
JobTitle            148654 non-null object
BasePay             148045 non-null float64
OvertimePay         148650 non-null float64
OtherPay            148650 non-null float64
Benefits            112491 non-null float64
TotalPay            148654 non-null float64
TotalPayBenefits    148654 non-null float64
Year                148654 non-null int64
Notes               0 non-null float64
Agency              148654 non-null object
Status              0 non-null float64
dtypes: float64(8), int64(2), object(3)
memory usage: 14.7+ MB
In [16]:
sal["EmployeeName"]
Out[16]:
0                 NATHANIEL FORD
1                   GARY JIMENEZ
2                 ALBERT PARDINI
3              CHRISTOPHER CHONG
4                PATRICK GARDNER
5                 DAVID SULLIVAN
6                      ALSON LEE
7                  DAVID KUSHNER
8                 MICHAEL MORRIS
9             JOANNE HAYES-WHITE
10                 ARTHUR KENNEY
11              PATRICIA JACKSON
12             EDWARD HARRINGTON
13                   JOHN MARTIN
14                DAVID FRANKLIN
15               RICHARD CORRIEA
16                      AMY HART
17                SEBASTIAN WONG
18                    MARTY ROSS
19                 ELLEN MOFFATT
20                    VENUS AZAR
21                  JUDY MELINEK
22                 GEORGE GARCIA
23                 VICTOR WYRSCH
24               JOSEPH DRISCOLL
25                  GREGORY SUHR
26                   JOHN HANLEY
27                RAYMOND GUZMAN
28                DENISE SCHMITT
29                 MONICA FIELDS
                   ...          
148624        Lorraine Rosenthal
148625           Renato C Gurion
148626             Paulet Gaines
148627          Brett A Lundberg
148628            Mark W Mcclure
148629         Elizabeth Iniguez
148630              Randy J Keys
148631           Andre M Johnson
148632    Sharon D Owens-Webster
148633          Edward Ferdinand
148634            David M Turner
148635       James S Kibblewhite
148636             Andrew J Enzi
148637          Kadeshra D Green
148638      Lennard B Hutchinson
148639         Richard A Talbert
148640        Charlene D Mccully
148641     Raphael Marquis Goins
148642         Dominic C Marquez
148643                Kim Brewer
148644              Randy D Winn
148645          Carolyn A Wilson
148646              Not provided
148647            Joann Anderson
148648               Leon Walker
148649             Roy I Tillery
148650              Not provided
148651              Not provided
148652              Not provided
148653                 Joe Lopez
Name: EmployeeName, Length: 148654, dtype: object
In [17]:
sal['BasePay'].mean()
Out[17]:
66325.44884050643
In [18]:
sal['BasePay'].max()
Out[18]:
319275.01000000001
In [19]:
sal['BasePay'].min()
Out[19]:
-166.00999999999999

What is the job title of JOSEPH DRISCOLL ? Note: Use all caps, otherwise you may get an answer that doesn't match up (there is also a lowercase Joseph Driscoll).

In [5]:
sal.head(5)
Out[5]:
Id EmployeeName JobTitle BasePay OvertimePay OtherPay Benefits TotalPay TotalPayBenefits Year Notes Agency Status
0 1 NATHANIEL FORD GENERAL MANAGER-METROPOLITAN TRANSIT AUTHORITY 167411.18 0.00 400184.25 NaN 567595.43 567595.43 2011 NaN San Francisco NaN
1 2 GARY JIMENEZ CAPTAIN III (POLICE DEPARTMENT) 155966.02 245131.88 137811.38 NaN 538909.28 538909.28 2011 NaN San Francisco NaN
2 3 ALBERT PARDINI CAPTAIN III (POLICE DEPARTMENT) 212739.13 106088.18 16452.60 NaN 335279.91 335279.91 2011 NaN San Francisco NaN
3 4 CHRISTOPHER CHONG WIRE ROPE CABLE MAINTENANCE MECHANIC 77916.00 56120.71 198306.90 NaN 332343.61 332343.61 2011 NaN San Francisco NaN
4 5 PATRICK GARDNER DEPUTY CHIEF OF DEPARTMENT,(FIRE DEPARTMENT) 134401.60 9737.00 182234.59 NaN 326373.19 326373.19 2011 NaN San Francisco NaN

No comments:

Post a Comment