w3resource

Pandas DataFrame: plot.scatter() function

DataFrame.plot.scatter() function

The plot-scatter() function is used to create a scatter plot with varying marker point size and color.

The coordinates of each point are defined by two dataframe columns and filled circles are used to represent each point. This kind of plot is useful to see complex correlations between two variables. Points could be for instance natural 2D coordinates like longitude and latitude in a map or, in general, any pair of metrics that can be plotted against each other.

Syntax:

DataFrame.plot.scatter(self, x, y, s=None, c=None, **kwargs)

Parameters:

Name Description Type/Default Value Required / Optional
x The column name or column position to be used as horizontal coordinates for each point.  int or str Required
y The column name or column position to be used as vertical coordinates for each point. int or str Required
s

The size of each point. Possible values are:

  • A single scalar so all points have the same size.
  • A sequence of scalars, which will be used for each point’s size recursively. For instance, when passing [2,14] all points size will be either 2 or 14, alternatively.
scalar or array_like Optional
c

The color of each point. Possible values are:

  • A single color string referred to by name, RGB or RGBA code, for instance ‘red’ or ‘#a98d19’.
  • A sequence of color strings referred to by name, RGB or RGBA code, which will be used for each point’s color recursively. For instance [‘green’,’yellow’] all points will be filled in green or yellow, alternatively.
  • A column name or position whose values will be used to color the marker points according to a colormap.
str, int or array_like Optional
**kwds Keyword arguments to pass on to DataFrame.plot().   Required

Returns: matplotlib.axes.Axes or numpy.ndarray of them

Example:


Download the Pandas DataFrame Notebooks from here.

Previous: DataFrame.plot.pie() function
Next: DataFrame.boxplot() function



Follow us on Facebook and Twitter for latest update.