Adjusted R-Squared
R-squared measures the proportion of the variation in your dependent variable (Y) explained by your independent variables (X) for a linear regression model. Adjusted R-squared adjusts the statistic based on the number of independent variables in the model.R2R2 shows how well terms (data points) fit a curve or line. Adjusted R2R2 also indicates how well terms fit a curve or line, but adjusts for the number of terms in a model. If you add more and more useless variables to a model, adjusted r-squared will decrease. If you add more useful variables, adjusted r-squared will increase.
Adjusted R2adjRadj2 will always be less than or equal to R2R2. You only need R2R2 when working with samples. In other words, R2R2 isn't necessary when you have data from an entire population.
R2adj=1−[(1−R2)(n−1)n−k−1]Radj2=1−[(1−R2)(n−1)n−k−1]
Where −
· nn = the number of points in your data sample.
· kk = the number of independent regressors, i.e. the number of variables in your model, excluding the constant.
Problem Statement:
A fund has a sample R-squared value close to 0.5 and it is doubtlessly offering higher risk adjusted returns with the sample size of 50 for 5 predictors. Find Adjusted R square value.
Solution:
Sample size = 50 Number of predictor = 5 Sample R - square = 0.5.Substitute the qualities in the equation,
R2adj=1−[(1−0.52)(50−1)50−5−1]=1−(0.75)×4944,=1−0.8352,=0.1648
Analysis of Variance also termed as ANOVA. It is procedure followed by statisticans to check the potential difference between scale-level dependent variable by a nominal-level variable having two or more categories. It was developed by Ronald Fisher in 1918 and it extends t-test and z-test which compares only nominal level variable to have just two categories.
ANOVAs are majorly of three types:
· One-way ANOVA - One-way ANOVA have only one independent variable and refers to numbers in this variable. For example, to assess differences in IQ by country, you can have 1, 2, and more countries data to compare.
· Two-way ANOVA - Two way ANOVA uses two independent variables. For example, to access differences in IQ by country (variable 1) and gender(variable 2). Here you can examine the interaction between two independent variables. Such Interactions may indicate that differences in IQ is not uniform across a independent variable. For examples females may have higher IQ score over males and have very high score over males in Europe than in America.
Two-way ANOVAs are also termed as factorial ANOVA and can be balanced as well as unbalanced. Balanced refers to having same number of participants in each group where as unbalanced refers to having different number of participants in each group. Following special kind of ANOVAs can be used to handle unbalanced groups.
o Hierarchical approach(Type 1) -If data was not intentionaly unbalanced and has some type of hierarchy between the factors.
o Classical experimental approach(Type 2) - If data was not intentionaly unbalanced and has no hierarchy between the factors.
o Full Regression approach(Type 3) - If data was intentionaly unbalanced because of population.
· N-way or Multivariate ANOVA - N-way ANOVA have multiple independent variables. For example, to assess differences in IQ by country, gender, age etc. simultaneously, N-way ANOVA is to be deployed.
Following are the general steps to carry out ANOVA.
· Setup null and alternative hypothesis where null hypothesis states that there is no significant difference among the groups. And alternative hypothesis assumes that there is a significant difference among the groups.
· Calculate F-ratio and probability of F.
· Compare p-value of the F-ratio with the established alpha or significance level.
· If p-value of F is less than 0.5 then reject the null hypothesis.
· If null hypothesis is rejected, conclude that mean of groups are not equal.