

The further the other values are from the mean the less probable they are. Data values other than the mean will be less probable. this value will have the highest probability). When collecting data, we expect to see this value more than any others when our data is normally distributed (i.e. It is a symmetric distribution where most of the observations cluster around a central peak, which we call the mean. I understand! Trust me, it will make more sense as we explain it and use it.

Whoa! That’s a tightly packed group of mathematical words. Introduction Figure 1.1: An Ideal Normal Distribution, Photo by: MediumĪ normal distribution ( aka a Gaussian distribution) is a continuous probability distribution for real-valued variables.

Data is often characterized by the types of distributions that it contains.

In order to ask the right questions, we need to ask some introductory questions, just like you might do when meeting a new person. It is essential, or at least very helpful, to have a good foundation in statistical principles before diving into this field.ĭata can tell us amazing stories if we ask it the right questions. There is a lot of hype around data science. Published by Teena Mary on SeptemSeptember 1, 2020ĭata is the new oil and new gold. Normal Distribution: An Introductory Guide to PDF and CDF
