Linearity - terminology
$begingroup$
i know that to be qualified as linear, a function must satisfy:
1) $f(u+v)=f(u)+f(v)$
2)$f(lambda u)=lambda f(u)$
and a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix).
However, when you think of a system of linear equations (here let's say n vars and n equations) such as this one:
$a_{11}x_{11}+a_{12}x_{12}+...+a_{1n}x_{1n}=y_1$
$... $
$a_{n1}x_{n1}+a_{n2}x_{n2}+...+a_{nn}x_{nn}= y_n$
if we consider the 1D case (only 2 vars), which I prefer, to make things clearer (for my question), to write in terms of $x$'s and $beta$'s, with this time the $beta$'s being the unknown parameters of the model, i.e., the slope and y-intercept in this specific case, (which can be written in matrix form as $Xbeta=y$:
$x_{11}beta_{11}+x_{12}beta_{12}=y_{1}$
$x_{21}beta_{21}+x_{22}beta_{22}=y_{2}$
where $x_{i1} =1$ which means that we could rewrite as:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
and therefore we have 2 equations of the form $ y=mx+b$ which if b is non-zero is normally considered affine.
So eventually, my question is about terminology: we talk about systems of linear equations, and indeed matrix multiplication is linear, however we have affine functions (lines with potentially non-zero y-intercepts)... If a kind an very knowledgeable person could clarify this for me, I would be very grateful, as I am quite concerned in such details.
linear-algebra terminology
$endgroup$
add a comment |
$begingroup$
i know that to be qualified as linear, a function must satisfy:
1) $f(u+v)=f(u)+f(v)$
2)$f(lambda u)=lambda f(u)$
and a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix).
However, when you think of a system of linear equations (here let's say n vars and n equations) such as this one:
$a_{11}x_{11}+a_{12}x_{12}+...+a_{1n}x_{1n}=y_1$
$... $
$a_{n1}x_{n1}+a_{n2}x_{n2}+...+a_{nn}x_{nn}= y_n$
if we consider the 1D case (only 2 vars), which I prefer, to make things clearer (for my question), to write in terms of $x$'s and $beta$'s, with this time the $beta$'s being the unknown parameters of the model, i.e., the slope and y-intercept in this specific case, (which can be written in matrix form as $Xbeta=y$:
$x_{11}beta_{11}+x_{12}beta_{12}=y_{1}$
$x_{21}beta_{21}+x_{22}beta_{22}=y_{2}$
where $x_{i1} =1$ which means that we could rewrite as:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
and therefore we have 2 equations of the form $ y=mx+b$ which if b is non-zero is normally considered affine.
So eventually, my question is about terminology: we talk about systems of linear equations, and indeed matrix multiplication is linear, however we have affine functions (lines with potentially non-zero y-intercepts)... If a kind an very knowledgeable person could clarify this for me, I would be very grateful, as I am quite concerned in such details.
linear-algebra terminology
$endgroup$
1
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56
add a comment |
$begingroup$
i know that to be qualified as linear, a function must satisfy:
1) $f(u+v)=f(u)+f(v)$
2)$f(lambda u)=lambda f(u)$
and a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix).
However, when you think of a system of linear equations (here let's say n vars and n equations) such as this one:
$a_{11}x_{11}+a_{12}x_{12}+...+a_{1n}x_{1n}=y_1$
$... $
$a_{n1}x_{n1}+a_{n2}x_{n2}+...+a_{nn}x_{nn}= y_n$
if we consider the 1D case (only 2 vars), which I prefer, to make things clearer (for my question), to write in terms of $x$'s and $beta$'s, with this time the $beta$'s being the unknown parameters of the model, i.e., the slope and y-intercept in this specific case, (which can be written in matrix form as $Xbeta=y$:
$x_{11}beta_{11}+x_{12}beta_{12}=y_{1}$
$x_{21}beta_{21}+x_{22}beta_{22}=y_{2}$
where $x_{i1} =1$ which means that we could rewrite as:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
and therefore we have 2 equations of the form $ y=mx+b$ which if b is non-zero is normally considered affine.
So eventually, my question is about terminology: we talk about systems of linear equations, and indeed matrix multiplication is linear, however we have affine functions (lines with potentially non-zero y-intercepts)... If a kind an very knowledgeable person could clarify this for me, I would be very grateful, as I am quite concerned in such details.
linear-algebra terminology
$endgroup$
i know that to be qualified as linear, a function must satisfy:
1) $f(u+v)=f(u)+f(v)$
2)$f(lambda u)=lambda f(u)$
and a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix).
However, when you think of a system of linear equations (here let's say n vars and n equations) such as this one:
$a_{11}x_{11}+a_{12}x_{12}+...+a_{1n}x_{1n}=y_1$
$... $
$a_{n1}x_{n1}+a_{n2}x_{n2}+...+a_{nn}x_{nn}= y_n$
if we consider the 1D case (only 2 vars), which I prefer, to make things clearer (for my question), to write in terms of $x$'s and $beta$'s, with this time the $beta$'s being the unknown parameters of the model, i.e., the slope and y-intercept in this specific case, (which can be written in matrix form as $Xbeta=y$:
$x_{11}beta_{11}+x_{12}beta_{12}=y_{1}$
$x_{21}beta_{21}+x_{22}beta_{22}=y_{2}$
where $x_{i1} =1$ which means that we could rewrite as:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
and therefore we have 2 equations of the form $ y=mx+b$ which if b is non-zero is normally considered affine.
So eventually, my question is about terminology: we talk about systems of linear equations, and indeed matrix multiplication is linear, however we have affine functions (lines with potentially non-zero y-intercepts)... If a kind an very knowledgeable person could clarify this for me, I would be very grateful, as I am quite concerned in such details.
linear-algebra terminology
linear-algebra terminology
asked Dec 31 '18 at 13:43
MachupicchuMachupicchu
279
279
1
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56
add a comment |
1
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56
1
1
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56
add a comment |
3 Answers
3
active
oldest
votes
$begingroup$
Your definition of a linear function is correct if you assume the correct definitions of the "+" and scalar multiplication operations you actually left undefined. Maybe it is beneficial
Let $X, Y$ be two vectorspaces over the same field $mathbb{K}$. A function $f: X to Y$ is called linear iff.
for every $x_1, x_2 in X$ we have: $f(x_1 + x_2) = f(x_1) oplus f(x_2)$
where "+" is the addition from the vectorspace $X$ and $oplus$ the one from $Y$.for every $lambda in mathbb{K}$ and $x in X$ we have: $f(lambda cdot x) = lambda odot f(x)$
where $cdot$ is the scalar multiplication from the vectorspace $X$ and $odot$ the one from $Y$.
A system of linear equations can be defined as
- having two vectorspaces $X, Y$ (e.g. $X = Y = mathbb{R}^2$)
- a linear function $f: X to Y$ between them (e.g. represented by a matrix if $X, Y$ are finite vectorspaces, in your post you called it $X$)
- and a "target" vector $y in Y$ (e.g. your $y = (y_1, y_2)^T$)
- and the problem: Seek $x in X$ such that $f(x) = y$.
The word "linear" in "system of linear equations" just refers to $f$ being linear. It does not necessarily refer to the equations representing linear functions, as you correctly saw, they are not even necessarily linear, but only affine. Indeed note that the terminology "system of equations" only makes sense if $X, Y$ are finite vectorspaces, see also the last paragraph of this post.
Let's take your example:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
That's equivalent to
$$begin{bmatrix}1 & x_{12} & 0 & 0\0 & 0 & 1 & x_{22}end{bmatrix}begin{bmatrix}beta_{11} \ beta_{12} \ beta_{21} \ beta_{22}end{bmatrix} = begin{bmatrix}y_1 \ y_2end{bmatrix}$$
Maybe you can see the resemblance to our definition above: as you seem to know, the matrix can be identified with a linear function $f$ with $(beta_{11}, beta{12}, beta{21}, beta{22})^T$ being its input argument, i.e. something from the "input" space $X$.
Hence we choose the input space $X = mathbb{R}^4$. The output space is determined by $(y_1, y_2)^T$, thus we choose $Y = mathbb{R}^2$. You could also argue with 4 being the number of columns and 2 being the number of rows of the matrix.
Now it's clear that this system matches our definition above with a function $f: mathbb{R}^4 to mathbb{R}^2$. I believe in you being able to give the definition of $f$ yourself.
Beware that the following you said in your post is not true:
(any linear transform = matrix) [sic!]
If the linear transformation (function) is between finite vectorspaces only and you choose a fixed basis, then yes, you can express every such linear function as a matrix. That's the so-called display matrix wrt. chosen bases.
If either $X$ or $Y$ is an infinite vectorspace, i.e. bases are now infinite (totally ordered) sets, then usually display matrices are not defined. The definition of "system of linear equations" given above importantly still makes sense, though, since we nowhere mentioned matrices to begin with.
$endgroup$
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
add a comment |
$begingroup$
The OP seems to understand the meaning of linear and affine as used in Linear Algebra well.
A line is the graph of an affine function/transformation, so the related equations are called "linear", as in, related to a line (and before a study of Linear Algebra, the affine function might itself confusingly be referred to as a "linear function").
$endgroup$
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
add a comment |
$begingroup$
"a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix)." :
Actually $Ax=b$ is not an "operation" at all, it's an equation.
If $A$ is a matrix and you define $f(x)=Ax$ then $f$ is linear. It's not "considered linear" because it's a linear combination of whatever; $f$ is linear, because it satisfies the definition: $f(x+y)=f(x)+f(y)$ and $f(lambda x)=lambda f(x)$.
$endgroup$
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3057726%2flinearity-terminology%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Your definition of a linear function is correct if you assume the correct definitions of the "+" and scalar multiplication operations you actually left undefined. Maybe it is beneficial
Let $X, Y$ be two vectorspaces over the same field $mathbb{K}$. A function $f: X to Y$ is called linear iff.
for every $x_1, x_2 in X$ we have: $f(x_1 + x_2) = f(x_1) oplus f(x_2)$
where "+" is the addition from the vectorspace $X$ and $oplus$ the one from $Y$.for every $lambda in mathbb{K}$ and $x in X$ we have: $f(lambda cdot x) = lambda odot f(x)$
where $cdot$ is the scalar multiplication from the vectorspace $X$ and $odot$ the one from $Y$.
A system of linear equations can be defined as
- having two vectorspaces $X, Y$ (e.g. $X = Y = mathbb{R}^2$)
- a linear function $f: X to Y$ between them (e.g. represented by a matrix if $X, Y$ are finite vectorspaces, in your post you called it $X$)
- and a "target" vector $y in Y$ (e.g. your $y = (y_1, y_2)^T$)
- and the problem: Seek $x in X$ such that $f(x) = y$.
The word "linear" in "system of linear equations" just refers to $f$ being linear. It does not necessarily refer to the equations representing linear functions, as you correctly saw, they are not even necessarily linear, but only affine. Indeed note that the terminology "system of equations" only makes sense if $X, Y$ are finite vectorspaces, see also the last paragraph of this post.
Let's take your example:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
That's equivalent to
$$begin{bmatrix}1 & x_{12} & 0 & 0\0 & 0 & 1 & x_{22}end{bmatrix}begin{bmatrix}beta_{11} \ beta_{12} \ beta_{21} \ beta_{22}end{bmatrix} = begin{bmatrix}y_1 \ y_2end{bmatrix}$$
Maybe you can see the resemblance to our definition above: as you seem to know, the matrix can be identified with a linear function $f$ with $(beta_{11}, beta{12}, beta{21}, beta{22})^T$ being its input argument, i.e. something from the "input" space $X$.
Hence we choose the input space $X = mathbb{R}^4$. The output space is determined by $(y_1, y_2)^T$, thus we choose $Y = mathbb{R}^2$. You could also argue with 4 being the number of columns and 2 being the number of rows of the matrix.
Now it's clear that this system matches our definition above with a function $f: mathbb{R}^4 to mathbb{R}^2$. I believe in you being able to give the definition of $f$ yourself.
Beware that the following you said in your post is not true:
(any linear transform = matrix) [sic!]
If the linear transformation (function) is between finite vectorspaces only and you choose a fixed basis, then yes, you can express every such linear function as a matrix. That's the so-called display matrix wrt. chosen bases.
If either $X$ or $Y$ is an infinite vectorspace, i.e. bases are now infinite (totally ordered) sets, then usually display matrices are not defined. The definition of "system of linear equations" given above importantly still makes sense, though, since we nowhere mentioned matrices to begin with.
$endgroup$
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
add a comment |
$begingroup$
Your definition of a linear function is correct if you assume the correct definitions of the "+" and scalar multiplication operations you actually left undefined. Maybe it is beneficial
Let $X, Y$ be two vectorspaces over the same field $mathbb{K}$. A function $f: X to Y$ is called linear iff.
for every $x_1, x_2 in X$ we have: $f(x_1 + x_2) = f(x_1) oplus f(x_2)$
where "+" is the addition from the vectorspace $X$ and $oplus$ the one from $Y$.for every $lambda in mathbb{K}$ and $x in X$ we have: $f(lambda cdot x) = lambda odot f(x)$
where $cdot$ is the scalar multiplication from the vectorspace $X$ and $odot$ the one from $Y$.
A system of linear equations can be defined as
- having two vectorspaces $X, Y$ (e.g. $X = Y = mathbb{R}^2$)
- a linear function $f: X to Y$ between them (e.g. represented by a matrix if $X, Y$ are finite vectorspaces, in your post you called it $X$)
- and a "target" vector $y in Y$ (e.g. your $y = (y_1, y_2)^T$)
- and the problem: Seek $x in X$ such that $f(x) = y$.
The word "linear" in "system of linear equations" just refers to $f$ being linear. It does not necessarily refer to the equations representing linear functions, as you correctly saw, they are not even necessarily linear, but only affine. Indeed note that the terminology "system of equations" only makes sense if $X, Y$ are finite vectorspaces, see also the last paragraph of this post.
Let's take your example:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
That's equivalent to
$$begin{bmatrix}1 & x_{12} & 0 & 0\0 & 0 & 1 & x_{22}end{bmatrix}begin{bmatrix}beta_{11} \ beta_{12} \ beta_{21} \ beta_{22}end{bmatrix} = begin{bmatrix}y_1 \ y_2end{bmatrix}$$
Maybe you can see the resemblance to our definition above: as you seem to know, the matrix can be identified with a linear function $f$ with $(beta_{11}, beta{12}, beta{21}, beta{22})^T$ being its input argument, i.e. something from the "input" space $X$.
Hence we choose the input space $X = mathbb{R}^4$. The output space is determined by $(y_1, y_2)^T$, thus we choose $Y = mathbb{R}^2$. You could also argue with 4 being the number of columns and 2 being the number of rows of the matrix.
Now it's clear that this system matches our definition above with a function $f: mathbb{R}^4 to mathbb{R}^2$. I believe in you being able to give the definition of $f$ yourself.
Beware that the following you said in your post is not true:
(any linear transform = matrix) [sic!]
If the linear transformation (function) is between finite vectorspaces only and you choose a fixed basis, then yes, you can express every such linear function as a matrix. That's the so-called display matrix wrt. chosen bases.
If either $X$ or $Y$ is an infinite vectorspace, i.e. bases are now infinite (totally ordered) sets, then usually display matrices are not defined. The definition of "system of linear equations" given above importantly still makes sense, though, since we nowhere mentioned matrices to begin with.
$endgroup$
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
add a comment |
$begingroup$
Your definition of a linear function is correct if you assume the correct definitions of the "+" and scalar multiplication operations you actually left undefined. Maybe it is beneficial
Let $X, Y$ be two vectorspaces over the same field $mathbb{K}$. A function $f: X to Y$ is called linear iff.
for every $x_1, x_2 in X$ we have: $f(x_1 + x_2) = f(x_1) oplus f(x_2)$
where "+" is the addition from the vectorspace $X$ and $oplus$ the one from $Y$.for every $lambda in mathbb{K}$ and $x in X$ we have: $f(lambda cdot x) = lambda odot f(x)$
where $cdot$ is the scalar multiplication from the vectorspace $X$ and $odot$ the one from $Y$.
A system of linear equations can be defined as
- having two vectorspaces $X, Y$ (e.g. $X = Y = mathbb{R}^2$)
- a linear function $f: X to Y$ between them (e.g. represented by a matrix if $X, Y$ are finite vectorspaces, in your post you called it $X$)
- and a "target" vector $y in Y$ (e.g. your $y = (y_1, y_2)^T$)
- and the problem: Seek $x in X$ such that $f(x) = y$.
The word "linear" in "system of linear equations" just refers to $f$ being linear. It does not necessarily refer to the equations representing linear functions, as you correctly saw, they are not even necessarily linear, but only affine. Indeed note that the terminology "system of equations" only makes sense if $X, Y$ are finite vectorspaces, see also the last paragraph of this post.
Let's take your example:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
That's equivalent to
$$begin{bmatrix}1 & x_{12} & 0 & 0\0 & 0 & 1 & x_{22}end{bmatrix}begin{bmatrix}beta_{11} \ beta_{12} \ beta_{21} \ beta_{22}end{bmatrix} = begin{bmatrix}y_1 \ y_2end{bmatrix}$$
Maybe you can see the resemblance to our definition above: as you seem to know, the matrix can be identified with a linear function $f$ with $(beta_{11}, beta{12}, beta{21}, beta{22})^T$ being its input argument, i.e. something from the "input" space $X$.
Hence we choose the input space $X = mathbb{R}^4$. The output space is determined by $(y_1, y_2)^T$, thus we choose $Y = mathbb{R}^2$. You could also argue with 4 being the number of columns and 2 being the number of rows of the matrix.
Now it's clear that this system matches our definition above with a function $f: mathbb{R}^4 to mathbb{R}^2$. I believe in you being able to give the definition of $f$ yourself.
Beware that the following you said in your post is not true:
(any linear transform = matrix) [sic!]
If the linear transformation (function) is between finite vectorspaces only and you choose a fixed basis, then yes, you can express every such linear function as a matrix. That's the so-called display matrix wrt. chosen bases.
If either $X$ or $Y$ is an infinite vectorspace, i.e. bases are now infinite (totally ordered) sets, then usually display matrices are not defined. The definition of "system of linear equations" given above importantly still makes sense, though, since we nowhere mentioned matrices to begin with.
$endgroup$
Your definition of a linear function is correct if you assume the correct definitions of the "+" and scalar multiplication operations you actually left undefined. Maybe it is beneficial
Let $X, Y$ be two vectorspaces over the same field $mathbb{K}$. A function $f: X to Y$ is called linear iff.
for every $x_1, x_2 in X$ we have: $f(x_1 + x_2) = f(x_1) oplus f(x_2)$
where "+" is the addition from the vectorspace $X$ and $oplus$ the one from $Y$.for every $lambda in mathbb{K}$ and $x in X$ we have: $f(lambda cdot x) = lambda odot f(x)$
where $cdot$ is the scalar multiplication from the vectorspace $X$ and $odot$ the one from $Y$.
A system of linear equations can be defined as
- having two vectorspaces $X, Y$ (e.g. $X = Y = mathbb{R}^2$)
- a linear function $f: X to Y$ between them (e.g. represented by a matrix if $X, Y$ are finite vectorspaces, in your post you called it $X$)
- and a "target" vector $y in Y$ (e.g. your $y = (y_1, y_2)^T$)
- and the problem: Seek $x in X$ such that $f(x) = y$.
The word "linear" in "system of linear equations" just refers to $f$ being linear. It does not necessarily refer to the equations representing linear functions, as you correctly saw, they are not even necessarily linear, but only affine. Indeed note that the terminology "system of equations" only makes sense if $X, Y$ are finite vectorspaces, see also the last paragraph of this post.
Let's take your example:
$beta_{11}+x_{12}beta_{12}=y_{1}$
$beta_{21}+x_{22}beta_{22}=y_{2}$
That's equivalent to
$$begin{bmatrix}1 & x_{12} & 0 & 0\0 & 0 & 1 & x_{22}end{bmatrix}begin{bmatrix}beta_{11} \ beta_{12} \ beta_{21} \ beta_{22}end{bmatrix} = begin{bmatrix}y_1 \ y_2end{bmatrix}$$
Maybe you can see the resemblance to our definition above: as you seem to know, the matrix can be identified with a linear function $f$ with $(beta_{11}, beta{12}, beta{21}, beta{22})^T$ being its input argument, i.e. something from the "input" space $X$.
Hence we choose the input space $X = mathbb{R}^4$. The output space is determined by $(y_1, y_2)^T$, thus we choose $Y = mathbb{R}^2$. You could also argue with 4 being the number of columns and 2 being the number of rows of the matrix.
Now it's clear that this system matches our definition above with a function $f: mathbb{R}^4 to mathbb{R}^2$. I believe in you being able to give the definition of $f$ yourself.
Beware that the following you said in your post is not true:
(any linear transform = matrix) [sic!]
If the linear transformation (function) is between finite vectorspaces only and you choose a fixed basis, then yes, you can express every such linear function as a matrix. That's the so-called display matrix wrt. chosen bases.
If either $X$ or $Y$ is an infinite vectorspace, i.e. bases are now infinite (totally ordered) sets, then usually display matrices are not defined. The definition of "system of linear equations" given above importantly still makes sense, though, since we nowhere mentioned matrices to begin with.
edited Dec 31 '18 at 15:27
answered Dec 31 '18 at 14:03
ComFreekComFreek
5521411
5521411
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
add a comment |
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
$begingroup$
Thanks for this truely fantastic answer. really. I could'nt hope better.
$endgroup$
– Machupicchu
Dec 31 '18 at 15:18
add a comment |
$begingroup$
The OP seems to understand the meaning of linear and affine as used in Linear Algebra well.
A line is the graph of an affine function/transformation, so the related equations are called "linear", as in, related to a line (and before a study of Linear Algebra, the affine function might itself confusingly be referred to as a "linear function").
$endgroup$
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
add a comment |
$begingroup$
The OP seems to understand the meaning of linear and affine as used in Linear Algebra well.
A line is the graph of an affine function/transformation, so the related equations are called "linear", as in, related to a line (and before a study of Linear Algebra, the affine function might itself confusingly be referred to as a "linear function").
$endgroup$
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
add a comment |
$begingroup$
The OP seems to understand the meaning of linear and affine as used in Linear Algebra well.
A line is the graph of an affine function/transformation, so the related equations are called "linear", as in, related to a line (and before a study of Linear Algebra, the affine function might itself confusingly be referred to as a "linear function").
$endgroup$
The OP seems to understand the meaning of linear and affine as used in Linear Algebra well.
A line is the graph of an affine function/transformation, so the related equations are called "linear", as in, related to a line (and before a study of Linear Algebra, the affine function might itself confusingly be referred to as a "linear function").
answered Dec 31 '18 at 13:50
Mark S.Mark S.
12.2k22771
12.2k22771
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
add a comment |
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
1
1
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
what does OP stand for?
$endgroup$
– Machupicchu
Dec 31 '18 at 13:54
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
$begingroup$
@Machupicchu Original Poster (or sometimes Original Post). In this case it means you.
$endgroup$
– Arthur
Dec 31 '18 at 14:01
add a comment |
$begingroup$
"a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix)." :
Actually $Ax=b$ is not an "operation" at all, it's an equation.
If $A$ is a matrix and you define $f(x)=Ax$ then $f$ is linear. It's not "considered linear" because it's a linear combination of whatever; $f$ is linear, because it satisfies the definition: $f(x+y)=f(x)+f(y)$ and $f(lambda x)=lambda f(x)$.
$endgroup$
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
add a comment |
$begingroup$
"a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix)." :
Actually $Ax=b$ is not an "operation" at all, it's an equation.
If $A$ is a matrix and you define $f(x)=Ax$ then $f$ is linear. It's not "considered linear" because it's a linear combination of whatever; $f$ is linear, because it satisfies the definition: $f(x+y)=f(x)+f(y)$ and $f(lambda x)=lambda f(x)$.
$endgroup$
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
add a comment |
$begingroup$
"a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix)." :
Actually $Ax=b$ is not an "operation" at all, it's an equation.
If $A$ is a matrix and you define $f(x)=Ax$ then $f$ is linear. It's not "considered linear" because it's a linear combination of whatever; $f$ is linear, because it satisfies the definition: $f(x+y)=f(x)+f(y)$ and $f(lambda x)=lambda f(x)$.
$endgroup$
"a matrix multiplication $Ax=b$ is considered linear operation because b is a linear combination of the columns of A (any linear transform = matrix)." :
Actually $Ax=b$ is not an "operation" at all, it's an equation.
If $A$ is a matrix and you define $f(x)=Ax$ then $f$ is linear. It's not "considered linear" because it's a linear combination of whatever; $f$ is linear, because it satisfies the definition: $f(x+y)=f(x)+f(y)$ and $f(lambda x)=lambda f(x)$.
answered Dec 31 '18 at 14:22
David C. UllrichDavid C. Ullrich
61.6k43995
61.6k43995
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
add a comment |
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
yes i know of course that Ax=b is an equation, i was meaning the "operation of matrix vector multiplication"... of course you are being a bit mean here ;)
$endgroup$
– Machupicchu
Dec 31 '18 at 15:09
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
$begingroup$
what i was meaning by this - maybe clumsy sentence 'considered linear'- is that any time you multiply a vector by a matrix like Ax, the result is a linear combination of columns of A, and this matrix A is corresponding to a linear transform
$endgroup$
– Machupicchu
Dec 31 '18 at 15:15
add a comment |
Thanks for contributing an answer to Mathematics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3057726%2flinearity-terminology%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
$begingroup$
You could also just think of the equations as being in the form $$textrm{linear expression}=textrm{value}$$ so the name comes from what is on the left side.
$endgroup$
– MPW
Dec 31 '18 at 13:54
$begingroup$
Ok, and in a sense as we "plug " into the matrix a column of 1's to get our intercept, then this becomes a linear combination of the columns you would say, and in that sense we could accept the definition of "linear"? the intercept is "linearized" so to speak :D
$endgroup$
– Machupicchu
Dec 31 '18 at 13:56