101.school
CoursesAbout
Search...⌘K
Generate a course with AI...

    Statistics 1-1

    Receive aemail containing the next unit.
    • Introduction to Statistics
      • 1.1Importance and Applications of statistics
      • 1.2Types of Data
      • 1.3Classification of Statistics
    • Descriptive Statistics
      • 2.1Measures of Central Tendency
      • 2.2Measures of Dispersion
    • Probability
      • 3.1Basic Probability Concepts
      • 3.2Conditional Probability
      • 3.3Theories of Probability
    • Probability Distribution
      • 4.1Probability Mass Function & Probability Density Function
      • 4.2Special Distributions: Binomial, Poisson & Normal Distributions
      • 4.3Central Limit Theorem
    • Sampling and Sampling Methods
      • 5.1Concept of Sampling
      • 5.2Different Sampling Techniques
    • Estimation and Hypothesis Testing
      • 6.1Point and Interval Estimation
      • 6.2Fundamentals of Hypothesis Testing
      • 6.3Type I and II Errors
    • Comparison of Two Populations
      • 7.1Independent Samples
      • 7.2Paired Samples
    • Analysis of Variance (ANOVA)
      • 8.1One-way ANOVA
      • 8.2Two-way ANOVA
    • Regression Analysis
      • 9.1Simple Regression
      • 9.2Multiple Regression
    • Correlation
      • 10.1Concept of Correlation
      • 10.2Types of Correlation
    • Nonparametric Statistics
      • 11.1Chi-Square Test
      • 11.2Mann-Whitney U Test
      • 11.3The Kruskal-Wallis Test
    • Statistical Applications in Quality and Productivity
      • 12.1Use of Statistics in Quality Control
      • 12.2Use of Statistics in Productivity
    • Software Application in Statistics
      • 13.1Introduction to Statistical Software
      • 13.2Statistical Analysis using Software

    Correlation

    Understanding the Concept of Correlation

    plot using the dispersal of scattered dots to show the relationship between variables

    Plot using the dispersal of scattered dots to show the relationship between variables.

    Introduction

    Correlation is a fundamental concept in statistics that measures the degree to which two or more variables move in relation to each other. It is a powerful tool that allows us to quantify the relationship between variables, making it easier to make predictions and draw conclusions.

    Definition of Correlation

    In statistics, correlation is a measure that describes the size and direction of a relationship between two or more variables. A correlation can be positive, meaning both variables move in the same direction, or negative, meaning that when one variable's value increases, the other variables' values decrease. Correlation is always measured between -1 and 1.

    Correlation Coefficient

    The correlation coefficient, often denoted by r, is a numerical measure that quantifies the degree of correlation between two variables. It ranges from -1 to 1, where -1 indicates a perfect negative correlation, 1 indicates a perfect positive correlation, and 0 indicates no correlation.

    The formula for calculating the correlation coefficient between two variables X and Y is:

    r = Σ[(xi - x̄)(yi - ȳ)] / √[Σ(xi - x̄)² * Σ(yi - ȳ)²]

    Where:

    • xi and yi are the individual sample points indexed with i
    • x̄ and ȳ are the mean of X and Y respectively

    Positive and Negative Correlation

    Positive correlation occurs when the values of both variables increase or decrease together. For example, the more hours you study, the higher your exam score tends to be. This is a positive correlation.

    Negative correlation, on the other hand, occurs when the value of one variable increases while the value of the other variable decreases. For example, the more time you spend watching TV, the lower your exam score tends to be. This is a negative correlation.

    Scatter Plots

    Scatter plots are a useful tool for visualizing the correlation between two variables. Each point on the plot corresponds to an observation in the dataset and its position along the X and Y axes represents its values for the two variables. The overall pattern of the distribution of points provides a visual representation of the correlation. A positive correlation is represented by points sloping upwards, and a negative correlation is represented by points sloping downwards.

    In conclusion, understanding the concept of correlation is crucial in many fields, including business, finance, medicine, and more. It allows us to make predictions and understand the relationships between different variables. However, it's important to remember that correlation does not imply causation - just because two variables are correlated, it doesn't mean that one causes the other to occur.

    Test me
    Practical exercise
    Further reading

    Buenos dias, any questions for me?

    Sign in to chat
    Next up: Types of Correlation