Wednesday, February 27, 2019

Calculating Correlation Values for Categorical Data

Calculating correlation values for unconditional selective information In order to find the correlation values for the palm in our information set, The Pearson Correlation Coefficient was used. This requires that the data in both fields be quantitative. But what if we were looking to calculate the correlation on deuce precondition fields that were say, numerical and categorical, or even both categorical. The Point Biserial coefficient is a special case of The Pearson Correlation Coefficient it is a branch of PCC although they are mathematically equivalent.It is used when angiotensin-converting enzyme field has quantitative data and the other has categorical values, specifically categorical data that fuck only be one of two options for example gender. To calculate the PBC the data is divided between the two values of the dichotomous data, where the two values of this field are presumptuousness the values 0 and 1. The distribution of the data will in oecumenical maneuver the f requencies for each value and can be used to show how well two fields are correlated.Spearmans Rank prepare Coefficient is a method of estimating correlation between data that is nominal and importantly must be ordered. It checks how well the relationship between the two fields can be described using a monotonic voice Another method for calculating the correlation is the Chi squared Test, this requires data to be classified and frequencies worked out in a table. From this table the correlations can be determined using the Chi Square Test, this works on any pair of nominal or categorical fields

No comments:

Post a Comment