Python Pandas - Dataframe Operations
Once we create a data frame, we can do various operations on it.These operations help us in analyzing the data or manipulating the data. First we will create a data frame from a .csv file using read_csv() function as shown below.This data frame will be the basis for our operations.
data:image/s3,"s3://crabby-images/998cc/998cce3e9e64dc2793e2582a38982221b590fd17" alt="data frame basis"
Dataframe columns and rows count
To know the number of rows and columns available in the data frame, we can use shape attribute. It returns a tuple that contains number of rows and columns as:
data:image/s3,"s3://crabby-images/eee06/eee06548fa2f666c6d942f4d0125e825f1e5ca9c" alt="shape attribute"
Suppose, we want to retrieve only rows or columns, we can read that number from the tuple as:
data:image/s3,"s3://crabby-images/375c8/375c80406b65501b8b02125ec7e8d24ed605ff73" alt="retrieving rows columns"
Dataframe get rows
The method head() gives the first 5 rows and the method tail() returns the last 5 rows, as shown below:
data:image/s3,"s3://crabby-images/e3ef2/e3ef2d5056750c25c33109062eeb719c00dde057" alt="retrieving rows from data frame"
data:image/s3,"s3://crabby-images/10eef/10eefdb4f09232a9990d9cceef141b45a0a51a29" alt="retrieving rows from data frame"
To display only the first 2 rows, we can use head() method by passing 2 to it as:
data:image/s3,"s3://crabby-images/0b816/0b816bad05e8bb351bcc6499bb64ab7c747a2fc7" alt="retrieving rows from data frame"
data:image/s3,"s3://crabby-images/f65a8/f65a8e0abdf6a941b12cc4a7e9e901e2cb2dc99f" alt="retrieving rows from data frame"
Dataframe rows in range
We can treat the data frame as an object and retrieve the rows from it using slicing. For example, if we write df[2:5], we can get 2nd row to 4th row (excludes 5th row).
data:image/s3,"s3://crabby-images/79264/79264b67314f28d0153f5d2ffbb71be65808b20e" alt="retrieving range of rows"
Similarly, to display alternate rows, we can use df[0::2] or df[::2] as shown below:
data:image/s3,"s3://crabby-images/04c5a/04c5a44f69fb4e310e627b1ec02ee3d5d3774eb4" alt="retrieving range of rows"
To display the rows in reverse order, we can use negative step size in slicing as:
data:image/s3,"s3://crabby-images/32db7/32db7b090f4d6f5f922b34243e36227757752210" alt="negative step size"
To Retrieve Column Names To retrieve the column names from the data frame, we can use columns attribute as:
data:image/s3,"s3://crabby-images/3cd28/3cd28e570981e9985c3c433550b09093ef06a567" alt="columns attribute"
Dataframe columns data
To get the column data, we can mention the column name as subscript.For example, df.empid will display all employee id numbers.This can also be done using df['empid'] which is shown below:
data:image/s3,"s3://crabby-images/f7551/f7551deac4e88221b12008dd25a3be0de0cc4dfa" alt="data frame empid"