What is the meaning of stacked integers in linear algebra notation?











up vote
1
down vote

favorite












I was recently reading a stack exchange post on solving a polynomial regression problem with gradient descent, and one contributor identified the objective function as:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|_2^2$$



They identified this as the "squared loss" i.e. L2 Loss function, which makes sense.



The notation in the equation, however, is new to me. Why are the 2's stacked atop each other in this equation? What do those 2's signify? What is the difference between the formulation above and:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|^2$$



I'd be very grateful for any help others can offer with this question.










share|cite|improve this question






















  • It's just the square of the $ell^2$ norm.
    – anomaly
    Nov 16 at 1:35












  • One is a subscript (indicating the particular norm), one is a square.
    – Randall
    Nov 16 at 1:36












  • Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
    – duhaime
    Nov 16 at 1:38






  • 1




    See: en.wikipedia.org/wiki/List_of_Banach_spaces
    – Randall
    Nov 16 at 1:42










  • Thanks @Randall, that link and zahbaz's note below cleared this up for me
    – duhaime
    Nov 16 at 1:50















up vote
1
down vote

favorite












I was recently reading a stack exchange post on solving a polynomial regression problem with gradient descent, and one contributor identified the objective function as:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|_2^2$$



They identified this as the "squared loss" i.e. L2 Loss function, which makes sense.



The notation in the equation, however, is new to me. Why are the 2's stacked atop each other in this equation? What do those 2's signify? What is the difference between the formulation above and:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|^2$$



I'd be very grateful for any help others can offer with this question.










share|cite|improve this question






















  • It's just the square of the $ell^2$ norm.
    – anomaly
    Nov 16 at 1:35












  • One is a subscript (indicating the particular norm), one is a square.
    – Randall
    Nov 16 at 1:36












  • Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
    – duhaime
    Nov 16 at 1:38






  • 1




    See: en.wikipedia.org/wiki/List_of_Banach_spaces
    – Randall
    Nov 16 at 1:42










  • Thanks @Randall, that link and zahbaz's note below cleared this up for me
    – duhaime
    Nov 16 at 1:50













up vote
1
down vote

favorite









up vote
1
down vote

favorite











I was recently reading a stack exchange post on solving a polynomial regression problem with gradient descent, and one contributor identified the objective function as:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|_2^2$$



They identified this as the "squared loss" i.e. L2 Loss function, which makes sense.



The notation in the equation, however, is new to me. Why are the 2's stacked atop each other in this equation? What do those 2's signify? What is the difference between the formulation above and:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|^2$$



I'd be very grateful for any help others can offer with this question.










share|cite|improve this question













I was recently reading a stack exchange post on solving a polynomial regression problem with gradient descent, and one contributor identified the objective function as:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|_2^2$$



They identified this as the "squared loss" i.e. L2 Loss function, which makes sense.



The notation in the equation, however, is new to me. Why are the 2's stacked atop each other in this equation? What do those 2's signify? What is the difference between the formulation above and:



$$text{argmin}_{beta} | mathbf{y - Xbeta}|^2$$



I'd be very grateful for any help others can offer with this question.







linear-algebra optimization notation






share|cite|improve this question













share|cite|improve this question











share|cite|improve this question




share|cite|improve this question










asked Nov 16 at 1:33









duhaime

1427




1427












  • It's just the square of the $ell^2$ norm.
    – anomaly
    Nov 16 at 1:35












  • One is a subscript (indicating the particular norm), one is a square.
    – Randall
    Nov 16 at 1:36












  • Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
    – duhaime
    Nov 16 at 1:38






  • 1




    See: en.wikipedia.org/wiki/List_of_Banach_spaces
    – Randall
    Nov 16 at 1:42










  • Thanks @Randall, that link and zahbaz's note below cleared this up for me
    – duhaime
    Nov 16 at 1:50


















  • It's just the square of the $ell^2$ norm.
    – anomaly
    Nov 16 at 1:35












  • One is a subscript (indicating the particular norm), one is a square.
    – Randall
    Nov 16 at 1:36












  • Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
    – duhaime
    Nov 16 at 1:38






  • 1




    See: en.wikipedia.org/wiki/List_of_Banach_spaces
    – Randall
    Nov 16 at 1:42










  • Thanks @Randall, that link and zahbaz's note below cleared this up for me
    – duhaime
    Nov 16 at 1:50
















It's just the square of the $ell^2$ norm.
– anomaly
Nov 16 at 1:35






It's just the square of the $ell^2$ norm.
– anomaly
Nov 16 at 1:35














One is a subscript (indicating the particular norm), one is a square.
– Randall
Nov 16 at 1:36






One is a subscript (indicating the particular norm), one is a square.
– Randall
Nov 16 at 1:36














Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
– duhaime
Nov 16 at 1:38




Hmm, I'm still not following. What is the subscript for? Why is the subscript a 2? Sorry I'm pretty daft (I was an English major so have to learn all this on my own)
– duhaime
Nov 16 at 1:38




1




1




See: en.wikipedia.org/wiki/List_of_Banach_spaces
– Randall
Nov 16 at 1:42




See: en.wikipedia.org/wiki/List_of_Banach_spaces
– Randall
Nov 16 at 1:42












Thanks @Randall, that link and zahbaz's note below cleared this up for me
– duhaime
Nov 16 at 1:50




Thanks @Randall, that link and zahbaz's note below cleared this up for me
– duhaime
Nov 16 at 1:50










2 Answers
2






active

oldest

votes

















up vote
1
down vote



accepted










The subscript is indicating the type of norm. For instance, the finite dimensional Lp norms are given by



$$||vec{x}||_p = left(sum_{i=1}^n |x_i|^p right)^{1/p}$$



The superscript is simply the square. So, the L2 norm is then



$$||vec{x}||_2 = left(sum_{i=1}^n |x_i|^2 right)^{1/2}$$



$$||vec{x}||_2^2 = sum_{i=1}^n x_i^2.$$



The main idea here is that there are many types of norms, and we often want to specify which one it is that we're using.






share|cite|improve this answer




























    up vote
    0
    down vote













    It just ensures you are not doing something like taking the Lp norm and then squaring it. That would have a subscript p and exponent 2. In this case, you get extra clarity that is L2 as usual. That is just an example showing how the subscript provides extra (usually unnecessary) clarification.






    share|cite|improve this answer





















      Your Answer





      StackExchange.ifUsing("editor", function () {
      return StackExchange.using("mathjaxEditing", function () {
      StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
      StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
      });
      });
      }, "mathjax-editing");

      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "69"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      convertImagesToLinks: true,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: 10,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      noCode: true, onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });














       

      draft saved


      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3000593%2fwhat-is-the-meaning-of-stacked-integers-in-linear-algebra-notation%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes








      up vote
      1
      down vote



      accepted










      The subscript is indicating the type of norm. For instance, the finite dimensional Lp norms are given by



      $$||vec{x}||_p = left(sum_{i=1}^n |x_i|^p right)^{1/p}$$



      The superscript is simply the square. So, the L2 norm is then



      $$||vec{x}||_2 = left(sum_{i=1}^n |x_i|^2 right)^{1/2}$$



      $$||vec{x}||_2^2 = sum_{i=1}^n x_i^2.$$



      The main idea here is that there are many types of norms, and we often want to specify which one it is that we're using.






      share|cite|improve this answer

























        up vote
        1
        down vote



        accepted










        The subscript is indicating the type of norm. For instance, the finite dimensional Lp norms are given by



        $$||vec{x}||_p = left(sum_{i=1}^n |x_i|^p right)^{1/p}$$



        The superscript is simply the square. So, the L2 norm is then



        $$||vec{x}||_2 = left(sum_{i=1}^n |x_i|^2 right)^{1/2}$$



        $$||vec{x}||_2^2 = sum_{i=1}^n x_i^2.$$



        The main idea here is that there are many types of norms, and we often want to specify which one it is that we're using.






        share|cite|improve this answer























          up vote
          1
          down vote



          accepted







          up vote
          1
          down vote



          accepted






          The subscript is indicating the type of norm. For instance, the finite dimensional Lp norms are given by



          $$||vec{x}||_p = left(sum_{i=1}^n |x_i|^p right)^{1/p}$$



          The superscript is simply the square. So, the L2 norm is then



          $$||vec{x}||_2 = left(sum_{i=1}^n |x_i|^2 right)^{1/2}$$



          $$||vec{x}||_2^2 = sum_{i=1}^n x_i^2.$$



          The main idea here is that there are many types of norms, and we often want to specify which one it is that we're using.






          share|cite|improve this answer












          The subscript is indicating the type of norm. For instance, the finite dimensional Lp norms are given by



          $$||vec{x}||_p = left(sum_{i=1}^n |x_i|^p right)^{1/p}$$



          The superscript is simply the square. So, the L2 norm is then



          $$||vec{x}||_2 = left(sum_{i=1}^n |x_i|^2 right)^{1/2}$$



          $$||vec{x}||_2^2 = sum_{i=1}^n x_i^2.$$



          The main idea here is that there are many types of norms, and we often want to specify which one it is that we're using.







          share|cite|improve this answer












          share|cite|improve this answer



          share|cite|improve this answer










          answered Nov 16 at 1:45









          zahbaz

          8,14921937




          8,14921937






















              up vote
              0
              down vote













              It just ensures you are not doing something like taking the Lp norm and then squaring it. That would have a subscript p and exponent 2. In this case, you get extra clarity that is L2 as usual. That is just an example showing how the subscript provides extra (usually unnecessary) clarification.






              share|cite|improve this answer

























                up vote
                0
                down vote













                It just ensures you are not doing something like taking the Lp norm and then squaring it. That would have a subscript p and exponent 2. In this case, you get extra clarity that is L2 as usual. That is just an example showing how the subscript provides extra (usually unnecessary) clarification.






                share|cite|improve this answer























                  up vote
                  0
                  down vote










                  up vote
                  0
                  down vote









                  It just ensures you are not doing something like taking the Lp norm and then squaring it. That would have a subscript p and exponent 2. In this case, you get extra clarity that is L2 as usual. That is just an example showing how the subscript provides extra (usually unnecessary) clarification.






                  share|cite|improve this answer












                  It just ensures you are not doing something like taking the Lp norm and then squaring it. That would have a subscript p and exponent 2. In this case, you get extra clarity that is L2 as usual. That is just an example showing how the subscript provides extra (usually unnecessary) clarification.







                  share|cite|improve this answer












                  share|cite|improve this answer



                  share|cite|improve this answer










                  answered Nov 16 at 1:39









                  AHusain

                  2,7602816




                  2,7602816






























                       

                      draft saved


                      draft discarded



















































                       


                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3000593%2fwhat-is-the-meaning-of-stacked-integers-in-linear-algebra-notation%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      Probability when a professor distributes a quiz and homework assignment to a class of n students.

                      Aardman Animations

                      Are they similar matrix