What is the null hypothesis for the individual p-values in multiple regression?
I have a linear regression model for a dependent variable Y based on two independent variables, X1 and X2, so I have a general form of a regression equation Y = A + B1*X1 + B2*X2 + $epsilon$,
where A is the intercept, e is the error term, and B1 and B2 are the respective coefficients of X1 and X2. I perform a multiple regression with software (statsmodel in Python) and I get coefficients for the model: A = a, B1 = b1, B2 = b2. The model also gives me P values for each coefficient: Pa, P1, and P2. My question is: What is the null hypothesis for those individual P values? I know that the null hypothesis for variable X1 means that the model has a 0 coefficient for B1, but what about the other variables? In other words, If the null hypothesis is Y = A + 0*X1 + B2 * X2, what are the values of A and B2 for the null hypothesis from which the P-value for B1 is derived?
regression p-value
New contributor
add a comment |
I have a linear regression model for a dependent variable Y based on two independent variables, X1 and X2, so I have a general form of a regression equation Y = A + B1*X1 + B2*X2 + $epsilon$,
where A is the intercept, e is the error term, and B1 and B2 are the respective coefficients of X1 and X2. I perform a multiple regression with software (statsmodel in Python) and I get coefficients for the model: A = a, B1 = b1, B2 = b2. The model also gives me P values for each coefficient: Pa, P1, and P2. My question is: What is the null hypothesis for those individual P values? I know that the null hypothesis for variable X1 means that the model has a 0 coefficient for B1, but what about the other variables? In other words, If the null hypothesis is Y = A + 0*X1 + B2 * X2, what are the values of A and B2 for the null hypothesis from which the P-value for B1 is derived?
regression p-value
New contributor
2
Your model is missing an error term.
– Andreas Dzemski
2 hours ago
add a comment |
I have a linear regression model for a dependent variable Y based on two independent variables, X1 and X2, so I have a general form of a regression equation Y = A + B1*X1 + B2*X2 + $epsilon$,
where A is the intercept, e is the error term, and B1 and B2 are the respective coefficients of X1 and X2. I perform a multiple regression with software (statsmodel in Python) and I get coefficients for the model: A = a, B1 = b1, B2 = b2. The model also gives me P values for each coefficient: Pa, P1, and P2. My question is: What is the null hypothesis for those individual P values? I know that the null hypothesis for variable X1 means that the model has a 0 coefficient for B1, but what about the other variables? In other words, If the null hypothesis is Y = A + 0*X1 + B2 * X2, what are the values of A and B2 for the null hypothesis from which the P-value for B1 is derived?
regression p-value
New contributor
I have a linear regression model for a dependent variable Y based on two independent variables, X1 and X2, so I have a general form of a regression equation Y = A + B1*X1 + B2*X2 + $epsilon$,
where A is the intercept, e is the error term, and B1 and B2 are the respective coefficients of X1 and X2. I perform a multiple regression with software (statsmodel in Python) and I get coefficients for the model: A = a, B1 = b1, B2 = b2. The model also gives me P values for each coefficient: Pa, P1, and P2. My question is: What is the null hypothesis for those individual P values? I know that the null hypothesis for variable X1 means that the model has a 0 coefficient for B1, but what about the other variables? In other words, If the null hypothesis is Y = A + 0*X1 + B2 * X2, what are the values of A and B2 for the null hypothesis from which the P-value for B1 is derived?
regression p-value
regression p-value
New contributor
New contributor
edited 1 hour ago
New contributor
asked 2 hours ago
tmldwn
162
162
New contributor
New contributor
2
Your model is missing an error term.
– Andreas Dzemski
2 hours ago
add a comment |
2
Your model is missing an error term.
– Andreas Dzemski
2 hours ago
2
2
Your model is missing an error term.
– Andreas Dzemski
2 hours ago
Your model is missing an error term.
– Andreas Dzemski
2 hours ago
add a comment |
2 Answers
2
active
oldest
votes
The null hypothesis is
$$
H_0: B1 = 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R},
$$
which basically means that the null hypothesis does not restrict B2 and A.
The alternative hypothesis is
$$
H_1: B1 neq 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R}.
$$
In a way, the null hypothesis in the multiple regression model is a composite hypothesis. It is "fortunate" that we can construct a pivotal test statistic that does not depend on the true value of B2 and A, so that we do not suffer a penalty from testing a composite null hypothesis.
In other words, there are a lot of different distributions of $(Y, X1, X2)$ that are compatible with the null hypothesis $H_0$. However, all of these distributions lead to the same behavior of the the test statistic that is used to test $H_0$.
In my answer, I have not addressed the distribution of $epsilon$ and implicitly assumed that it is an independent centered normal random variable. If we only assume something like
$$
E[epsilon mid X1, X2] = 0
$$
then a similar conclusion holds asymptotically (under regularity assumptions).
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
add a comment |
You can make the same assupmtions for the other variables as the X1. The ANOVA table of the regression gives specific information about each variable significance and the overall significance as well.As far as regression analysis is concerned, the acceptance of null hypothesis implies that the coefficient of the variable is zero, given a certain level of significance.
If you want to acquire a more intuitive aspect of the issue, you can study more about Hypothesis testing.
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
tmldwn is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f385005%2fwhat-is-the-null-hypothesis-for-the-individual-p-values-in-multiple-regression%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
The null hypothesis is
$$
H_0: B1 = 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R},
$$
which basically means that the null hypothesis does not restrict B2 and A.
The alternative hypothesis is
$$
H_1: B1 neq 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R}.
$$
In a way, the null hypothesis in the multiple regression model is a composite hypothesis. It is "fortunate" that we can construct a pivotal test statistic that does not depend on the true value of B2 and A, so that we do not suffer a penalty from testing a composite null hypothesis.
In other words, there are a lot of different distributions of $(Y, X1, X2)$ that are compatible with the null hypothesis $H_0$. However, all of these distributions lead to the same behavior of the the test statistic that is used to test $H_0$.
In my answer, I have not addressed the distribution of $epsilon$ and implicitly assumed that it is an independent centered normal random variable. If we only assume something like
$$
E[epsilon mid X1, X2] = 0
$$
then a similar conclusion holds asymptotically (under regularity assumptions).
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
add a comment |
The null hypothesis is
$$
H_0: B1 = 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R},
$$
which basically means that the null hypothesis does not restrict B2 and A.
The alternative hypothesis is
$$
H_1: B1 neq 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R}.
$$
In a way, the null hypothesis in the multiple regression model is a composite hypothesis. It is "fortunate" that we can construct a pivotal test statistic that does not depend on the true value of B2 and A, so that we do not suffer a penalty from testing a composite null hypothesis.
In other words, there are a lot of different distributions of $(Y, X1, X2)$ that are compatible with the null hypothesis $H_0$. However, all of these distributions lead to the same behavior of the the test statistic that is used to test $H_0$.
In my answer, I have not addressed the distribution of $epsilon$ and implicitly assumed that it is an independent centered normal random variable. If we only assume something like
$$
E[epsilon mid X1, X2] = 0
$$
then a similar conclusion holds asymptotically (under regularity assumptions).
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
add a comment |
The null hypothesis is
$$
H_0: B1 = 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R},
$$
which basically means that the null hypothesis does not restrict B2 and A.
The alternative hypothesis is
$$
H_1: B1 neq 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R}.
$$
In a way, the null hypothesis in the multiple regression model is a composite hypothesis. It is "fortunate" that we can construct a pivotal test statistic that does not depend on the true value of B2 and A, so that we do not suffer a penalty from testing a composite null hypothesis.
In other words, there are a lot of different distributions of $(Y, X1, X2)$ that are compatible with the null hypothesis $H_0$. However, all of these distributions lead to the same behavior of the the test statistic that is used to test $H_0$.
In my answer, I have not addressed the distribution of $epsilon$ and implicitly assumed that it is an independent centered normal random variable. If we only assume something like
$$
E[epsilon mid X1, X2] = 0
$$
then a similar conclusion holds asymptotically (under regularity assumptions).
The null hypothesis is
$$
H_0: B1 = 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R},
$$
which basically means that the null hypothesis does not restrict B2 and A.
The alternative hypothesis is
$$
H_1: B1 neq 0 : text{and} : B2 in mathbb{R} : text{and} : A in mathbb{R}.
$$
In a way, the null hypothesis in the multiple regression model is a composite hypothesis. It is "fortunate" that we can construct a pivotal test statistic that does not depend on the true value of B2 and A, so that we do not suffer a penalty from testing a composite null hypothesis.
In other words, there are a lot of different distributions of $(Y, X1, X2)$ that are compatible with the null hypothesis $H_0$. However, all of these distributions lead to the same behavior of the the test statistic that is used to test $H_0$.
In my answer, I have not addressed the distribution of $epsilon$ and implicitly assumed that it is an independent centered normal random variable. If we only assume something like
$$
E[epsilon mid X1, X2] = 0
$$
then a similar conclusion holds asymptotically (under regularity assumptions).
edited 44 mins ago
answered 2 hours ago
Andreas Dzemski
1915
1915
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
add a comment |
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
But as I understand it, doesn't the null hypothesis have to be a probability distribution? If I have specific values for the coefficients, I can generate a probability distribution by adding noise (epsilon) to the regression equation. But if I don't have specific values for coefficients, how would I generate the null probability distribution?
– tmldwn
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
A composite null hypothesis is a whole set of possible probability measures.
– Andreas Dzemski
1 hour ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
I have edited my answer to emphasize this point.
– Andreas Dzemski
43 mins ago
add a comment |
You can make the same assupmtions for the other variables as the X1. The ANOVA table of the regression gives specific information about each variable significance and the overall significance as well.As far as regression analysis is concerned, the acceptance of null hypothesis implies that the coefficient of the variable is zero, given a certain level of significance.
If you want to acquire a more intuitive aspect of the issue, you can study more about Hypothesis testing.
add a comment |
You can make the same assupmtions for the other variables as the X1. The ANOVA table of the regression gives specific information about each variable significance and the overall significance as well.As far as regression analysis is concerned, the acceptance of null hypothesis implies that the coefficient of the variable is zero, given a certain level of significance.
If you want to acquire a more intuitive aspect of the issue, you can study more about Hypothesis testing.
add a comment |
You can make the same assupmtions for the other variables as the X1. The ANOVA table of the regression gives specific information about each variable significance and the overall significance as well.As far as regression analysis is concerned, the acceptance of null hypothesis implies that the coefficient of the variable is zero, given a certain level of significance.
If you want to acquire a more intuitive aspect of the issue, you can study more about Hypothesis testing.
You can make the same assupmtions for the other variables as the X1. The ANOVA table of the regression gives specific information about each variable significance and the overall significance as well.As far as regression analysis is concerned, the acceptance of null hypothesis implies that the coefficient of the variable is zero, given a certain level of significance.
If you want to acquire a more intuitive aspect of the issue, you can study more about Hypothesis testing.
answered 2 hours ago
Logicseeker
183
183
add a comment |
add a comment |
tmldwn is a new contributor. Be nice, and check out our Code of Conduct.
tmldwn is a new contributor. Be nice, and check out our Code of Conduct.
tmldwn is a new contributor. Be nice, and check out our Code of Conduct.
tmldwn is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Cross Validated!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f385005%2fwhat-is-the-null-hypothesis-for-the-individual-p-values-in-multiple-regression%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
2
Your model is missing an error term.
– Andreas Dzemski
2 hours ago