What does a Q-Q plot tell you?

What does a Q-Q plot tell you?

The Q-Q plot, or quantile-quantile plot, is a graphical tool to help us assess if a set of data plausibly came from some theoretical distribution such as a Normal or exponential. If both sets of quantiles came from the same distribution, we should see the points forming a line that’s roughly straight.

What does a horizontal Q-Q plot mean?

This Q–Q plot compares a sample of data on the vertical axis to a statistical population on the horizontal axis. The points follow a strongly nonlinear pattern, suggesting that the data are not distributed as a standard normal (X ~ N(0,1)).

What happens if Q-Q plot is not normal?

A normal probability plot, or more specifically a quantile-quantile (Q-Q) plot, shows the distribution of the data against the expected normal distribution. If the data is non-normal, the points form a curve that deviates markedly from a straight line.

Should I use P-P plot or QQ plot?

A P-P plot compares the empirical cumulative distribution function of a data set with a specified theoretical cumulative distribution function F(·). A Q-Q plot compares the quantiles of a data distribution with the quantiles of a standardized theoretical distribution from a specified family of distributions.

How do you explain a P-P plot?

In statistics, a P–P plot (probability–probability plot or percent–percent plot or P value plot) is a probability plot for assessing how closely two data sets agree, which plots the two cumulative distribution functions against each other. P-P plots are vastly used to evaluate the skewness of a distribution.

What does the Q stand for in a Q plot?

The Q’s stand for “quantile” and a Q-Q plot. Technically speaking, a Q-Q plot compares the distribution of two sets of data. In most cases, a probability plot will be most useful. A probability plot compares the distribution of a data set with a theoretical distribution.

Which is the slope of a QQ plot?

The slope is the scale and the intercept is the location: The histograms and density estimates for the duration variable in the geyser data set showed that the distribution is far from a normal distribution, and the normal QQ plot shows this as well:

How does a Q-Q plot compare two sets of data?

Technically speaking, a Q-Q plot compares the distribution of two sets of data. In most cases, a probability plot will be most useful. A probability plot compares the distribution of a data set with a theoretical distribution. The R function qqnorm () compares a data set with the theoretical normal distibution.

Which is not normal in a QQ plot?

Let’s take a look at the QQ plot for something that’s not normal: Which generates this plot: Our data (the blue dots) is nowhere close to the red line, meaning it’s not normally distributed (it’s uniformly distributed).