Learn to use pandas.unique() with Series/DataFrame


Python Pandas

Using Pandas unique() with Series/DataFrame

unique() function is used to get the distinct / unique data from Series/DataFrames. It can be applied on the columns in the dataframe.

The unique() method returns a NumPy ndarray of unique values from the Series

In this tutorial we will discuss how to use unique() function with Series and DataFrame in Pandas.

 

Pandas Series.unique - Syntax:

 data.unique()

where, data is Series

 

Pandas DataFrame.unique() - Syntax:

 data['column'].unique()

where, data is the dataframe and column is the name of the column in the dataframe to get the unique values

We will discuss the following scenarios

  • Get unique values from Series
  • Get unique values from particular columns in DataFrame
  • unique() with '.' operator in Series
  • pandas.unique()

 

1. Using Pandas Series.unique()

In this scenario, we can get the unique data from the given series.

Example 1: In this example,  We are creating two series one with integer and another with strings and return the unique values from the first series.

# import pandas
import pandas 

# creating the series -1
data1 = pandas.Series([10,20,30,50,60,10,20,0,45,20])

# get the unique values 
print(data1.unique())

Output:

[10 20 30 50 60  0 45]

 

Example 2: In this example,  We are creating two series one with integer and another with strings and return the unique values from the second series.

# import pandas
import pandas 

# creating the series-2
data2 = pandas.Series(['Python','java','html','php','R','Python','java','html','php','R'])

# get the unique values 
print(data2.unique())

Output:

['Python' 'java' 'html' 'php' 'R']

 

2. Using Pandas DataFrame with unique()

In this case, we have to create the pandas dataframe and get the unique values from the dataframe columns by specifying the column names.

Example 1: In this example, we are going  to get the unique values from Subjects column in the dataframe. So the output will be all unique values from the Subjects column.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(data1['Subjects'].unique())

Output:

['java' 'python' 'html' 'php']

 

Example 2: In this example, we are going  to get the unique values from marks column in the dataframe. So the output will be all unique values from the marks column.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(data1['marks'].unique())

Output:

[ 80 98 76 87 90 100]

 

3. Using unique() with '.' operator in Series

Here we are going to create a dataframe and get the unique values from different columns. We have to specify dot(.) operator for specifying the column.

Syntax:

dataframe.column.unique()

where,

  1. dataframe is the input dataframe
  2. column is the column name to get unique values from this column.

 

Example 1: In this example, we are going  to get the unique values from Subjects column in the dataframe. So the output will be all unique values from the Subjects column through the "." dot operator.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(data1.Subjects.unique())

Output:

['java' 'python' 'html' 'php']

 

Example 2: In this example, we are going  to get the unique values from marks column in the dataframe. So the output will be all unique values from the marks column through the "." dot operator.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(data1.marks.unique())

Output:

[ 80 98 76 87 90 100]

 

4. Using pandas.unique()

Return unique values based on a hash table. Uniques are returned in order of appearance. This does NOT sort. Significantly faster than numpy.unique for long enough sequences. Includes NA values.

Syntax:

pandas.unique(dataframe['column'])

where,

  1. dataframe is the input dataframe
  2. column is the column name to get unique values from this column.

 

Example 1: In this example, we are going  to get the unique values from Subjects column in the dataframe.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(pandas.unique(dataframe['Subjects']))

Output:

['java' 'python' 'html' 'php']

 

Example 2: In this example, we are going  to get the unique values from marks column in the dataframe.

# import pandas
import pandas 

# creating the DataFrame 1
data1 = pandas.DataFrame({'Subjects':['java','java','python','html','html','php'],'marks':[80,98,76,87,90,100]})

# get the unique values 
print(pandas.unique(dataframe['marks']))

Output:

[ 80 98 76 87 90 100]

 

Summary

In this tutorial we discussed how to get the unique data from pandas Series and DataFrame using unique() function and we also several ways to get the unique data from the dataframe columns. Through the below implementations, we can het the unique values.

  • unique() with '.' operator in Series
  • pandas.unique()

unique() function is applied on machine learning projects/applications to know the duplicate data, with this we will get the accurate results.

 

References

Pandas - unique()

 

Deepak Prasad

Deepak Prasad

He is the founder of GoLinuxCloud and brings over a decade of expertise in Linux, Python, Go, Laravel, DevOps, Kubernetes, Git, Shell scripting, OpenShift, AWS, Networking, and Security. With extensive experience, he excels in various domains, from development to DevOps, Networking, and Security, ensuring robust and efficient solutions for diverse projects. You can connect with him on his LinkedIn profile.

Can't find what you're searching for? Let us assist you.

Enter your query below, and we'll provide instant results tailored to your needs.

If my articles on GoLinuxCloud has helped you, kindly consider buying me a coffee as a token of appreciation.

Buy GoLinuxCloud a Coffee

For any other feedbacks or questions you can send mail to admin@golinuxcloud.com

Thank You for your support!!

Leave a Comment