Change language

Exploring Data Distribution | Set 2

| | |

Terms related to data dissemination research

 -" Boxplot -" Frequency Table -" Histogram -" Density Plot 

To get a link to the csv file in use, click here .

Loading Libraries

import numpy as np

import pandas as pd

import seaborn as sns

import matplotlib.pyplot as plt

Loading data

data = pd .read_csv ( "../ data / state.csv" < / code> )

 
# Adding new derived data column

data [ ’PopulationInMillions’ ] = data [ ’ Population’ ] / 1000000

 

print (data.head ( 10 ))

Output:

  • Histogram: is a way to visualize the distribution of data through a table frequencies with cells along the X-axis and counting data along the Y-axis.

    Code — histogram

    # Histogram population in millions

     

    fig, ax2 = plt.subplots ()

    fig.set_size_inches ( 9 15 )

     

    ax2 = sns.distplot (data.PopulationInMillions, kde = False )

    ax2.set_ylabel ( "Frequency" , fontsize = 15 )

    ax2.set_xlabel ( "Population by State in Millions" , fontsize = 15 )

    ax2.set_title ( "Population - Histogram" , fontsize = 20 )

    Output:

  • Density plot : it is associated with a histogram as it shows the data values ​​distributed as a continuous line. This is a smoothed version of the histogram. The output below is — it is the density of the density superimposed on the histogram.

    Code — Data density plot

    # Density Plot - Population

     

    fig, ax3 = plt.subplots ()

    fig.set_size_inches ( 7 9 )

      

    ax3 = sns.distplot (data.Population, kde = True )

    ax3.set_ylabel ( "Density" , fontsize = 15 )

    ax3.set_xlabel ( " Murder Rate per Million " , fontsize = 15 )

    ax3.set_title ( "Desnsity Plot - Population" , fontsize = 20 )

    Output:

Shop

Learn programming in R: courses

$

Best Python online courses for 2022

$

Best laptop for Fortnite

$

Best laptop for Excel

$

Best laptop for Solidworks

$

Best laptop for Roblox

$

Best computer for crypto mining

$

Best laptop for Sims 4

$

Latest questions

NUMPYNUMPY

psycopg2: insert multiple rows with one query

12 answers

NUMPYNUMPY

How to convert Nonetype to int or string?

12 answers

NUMPYNUMPY

How to specify multiple return types using type-hints

12 answers

NUMPYNUMPY

Javascript Error: IPython is not defined in JupyterLab

12 answers

News


Wiki

Python OpenCV | cv2.putText () method

numpy.arctan2 () in Python

Python | os.path.realpath () method

Python OpenCV | cv2.circle () method

Python OpenCV cv2.cvtColor () method

Python - Move item to the end of the list

time.perf_counter () function in Python

Check if one list is a subset of another in Python

Python os.path.join () method