w3resource

Pandas Practice Set-1: Compute a cross-tabulation of two Series in diamonds DataFrame

Pandas Practice Set-1: Exercise-35 with Solution

Write a Pandas program to compute a cross-tabulation of two Series in diamonds DataFrame.

Sample Solution:

Python Code:

import pandas as pd
diamonds = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/diamonds.csv')
print("Original Dataframe:")
print(diamonds.head())
print("\nCross-tabulation of two Series of diamonds DataFrame:")
print(pd.crosstab(diamonds.cut, diamonds.price))

Sample Output:

Original Dataframe:
   carat      cut color clarity  depth  table  price     x     y     z
0   0.23    Ideal     E     SI2   61.5   55.0    326  3.95  3.98  2.43
1   0.21  Premium     E     SI1   59.8   61.0    326  3.89  3.84  2.31
2   0.23     Good     E     VS1   56.9   65.0    327  4.05  4.07  2.31
3   0.29  Premium     I     VS2   62.4   58.0    334  4.20  4.23  2.63
4   0.31     Good     J     SI2   63.3   58.0    335  4.34  4.35  2.75

Cross-tabulation of two Series of diamonds DataFrame:
price      326    327    334    335    ...    18804  18806  18818  18823
cut                                    ...                              
Fair           0      0      0      0  ...        0      0      0      0
Good           0      1      0      1  ...        0      0      0      0
Ideal          1      0      0      0  ...        1      1      0      0
Premium        1      0      1      0  ...        0      0      0      1
Very Good      0      0      0      0  ...        0      0      1      0

[5 rows x 11602 columns]

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

Previous: Write a Pandas program to count the number of unique values in cut series of diamonds DataFrame.
Next: Write a Pandas program to calculate various summary statistics of cut series of diamonds DataFrame.

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.