5 ways you can create histogram using pandas DataFrame


Deepak Prasad

Python Pandas

Create histogram with pandas hist() function

By using hist() function, we can create a histogram through pandas. A histogram is a representation of the distribution of data. This function calls <span class="pre">matplotlib.pyplot.hist()</span>, on each series in the DataFrame, resulting in one histogram per column.

 

Syntax:

DataFrame.hist(column=None, by=None, xlabelsize=None, ylabelsize=None, figsize=None, layout=None, bins=n, color=None)

Parameters

  • data - It refers to the input pandas DataFrame
  • column - It refers to the input pandas DataFrame column, where we can get histogram on this column.
  • by - It will refers to the histogram to form the separate groups
  • xlabelsize - It refers to size of histogram on x - axis
  • ylabelsize -It refers to size of histogram on y - axis
  • figsize - It  represents the size of histogram
  • layout - It refers to rows/columns layout.
  • bins - It refers to the number of bins to the histogram needed
  • color - histogram color (default is blue)

 

Different methods to create and customize histogram in Pandas

  • Create Histogram from single column in a dataframe
  • Create Histogram from entire dataframe
  • Create Histogram with specific size
  • Create Histogram with number of bins
  • Create Histogram with specific color

 

Create pandas DataFrame with example data

DataFrame is a data structure used to store the data in two dimensional format. It is similar to table that stores the data in rows and columns. Rows represents the records/ tuples and columns refers to the attributes.

We can create the DataFrame by using pandas.DataFrame() method.

Syntax:

pandas.DataFrame(input_data,columns,index)

Parameters:

It will take mainly three parameters

  1. input_data is represents a list of data
  2. columns represent the columns names for the data
  3. index represent the row numbers/values

We can also create a DataFrame using dictionary by skipping columns and indices.

Example: Python Program to create a dataframe for market data from a dictionary of food items by specifying the column names.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57}})

# display dataframe
print(dataframe)

Output:

     Name Subjects  Marks
0  sravan      PHP     89
1   bobby      PHP     90
2  deepak     dbms     93
3  prasad     java     57

 

Method 1 : Create Histogram from single column in a dataframe

Example 1: In this example, we are creating a histogram from  Age column with out specifying any parameters.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist('Age')

Output:

5 ways you can create histogram using pandas DataFrame

 

Example 2: In this example, we are creating a histogram from Marks column with out specifying any parameters.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist('Marks')

Output:

5 ways you can create histogram using pandas DataFrame

 

Method 2 : Create Histogram from entire dataframe

Example: In this example, we are creating a histogram from  the entire dataframe and not specifying any parameters.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist()

Output:

5 ways you can create histogram using pandas DataFrame

 

Method 3 : Create Histogram with specific size

Example  : In this example, we are creating a histogram from  Age column with x and y labelsize parameters. we are setting 11 for x and 4 for ylabelsize.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist('Age',xlabelsize=11,ylabelsize=4)

Output:

5 ways you can create histogram using pandas DataFrame

 

Method 4 : Create Histogram with number of bins

Example 1: In this example, we are creating a histogram from  Age column with bins parameter, we are setting 2 as bin value , which refers to the number of bins.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist('Age',bins=2)

Output:

5 ways you can create histogram using pandas DataFrame

 

Example 2: In this example, we are creating a histogram from  entire dataframe with bins parameter, we are setting 2 as bin value , which refers to the number of bins.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist(bins=2)

Output:

5 ways you can create histogram using pandas DataFrame

 

Method 5 : Create Histogram with specific color

Example 1: In this example, we are creating a histogram from  the entire dataframe and adding green color to the created histogram

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist(color="green")

Output:

5 ways you can create histogram using pandas DataFrame

 

Example 2: In this example, we are creating a histogram from  the entire dataframe and adding red color to the created histogram

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
dataframe.hist(color="red")

Output:

5 ways you can create histogram using pandas DataFrame

 

Some more Examples

Example-1: In this example, we are creating histogram from the entire dataframe by specifying x and y labelsizes as 3 and 5 along with 5 bins and set the histogram color to pink.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
print(dataframe.hist(color="pink",bins=3,xlabelsize=3,ylabelsize=5))


Output:

5 ways you can create histogram using pandas DataFrame

 

Example-2: In this example, we are creating histogram from the entire dataframe by specifying figsize as (7,8) along with 5 bins and set the histogram color to pink.

# import pandas 
import pandas 

# create dataframe with college data
dataframe = pandas.DataFrame({'Name': {0: 'sravan', 1: 'bobby', 2: 'deepak',3:'prasad'},
				'Subjects': {0: 'PHP', 1: 'PHP', 2: 'dbms',3:'java'},
				'Marks': {0: 89, 1: 90, 2: 93,3:57},
        'Age':{0:23,1:45,2:32,3:34}})

# create histogram
print(dataframe.hist(color="pink",bins=3,figsize=(7,8)))


Output:

5 ways you can create histogram using pandas DataFrame

 

Summary

In data visualization , Histogram plays an important role for visualizing the data, And the attraction comes from the histogram itself. In this topic, we discussed how to create a histogram in pandas . we seen all the parameters which are required for the creation of histogram.

To summarise, we learned to create histogram in Pandas with following parameters:

  • With specific size
  • With number of bins
  • Histogram with specific color
  • With figsize

 

Reference

Histogram in Pandas

 

Views: 15

Deepak Prasad

He is the founder of GoLinuxCloud and brings over a decade of expertise in Linux, Python, Go, Laravel, DevOps, Kubernetes, Git, Shell scripting, OpenShift, AWS, Networking, and Security. With extensive experience, he excels in various domains, from development to DevOps, Networking, and Security, ensuring robust and efficient solutions for diverse projects. You can reach out to him on his LinkedIn profile or join on Facebook page.

Can't find what you're searching for? Let us assist you.

Enter your query below, and we'll provide instant results tailored to your needs.

If my articles on GoLinuxCloud has helped you, kindly consider buying me a coffee as a token of appreciation.

Buy GoLinuxCloud a Coffee

For any other feedbacks or questions you can send mail to admin@golinuxcloud.com

Thank You for your support!!

Leave a Comment

GoLinuxCloud Logo


We try to offer easy-to-follow guides and tips on various topics such as Linux, Cloud Computing, Programming Languages, Ethical Hacking and much more.

Programming Languages

JavaScript

Python

Golang

Node.js

Java

Laravel