Looping over pandas DataFrame





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







0















I have a weird issue that the result doesn't change for each iteration. The code is the following:



import pandas as pd
import numpy as np

X = np.arange(10,100)
Y = X[::-1]
Z = np.array([X,Y]).T

df = pd.DataFrame(Z ,columns = ['col1','col2'])
dif = df['col1'] - df['col2']

for gap in range(100):
Up = dif > gap
Down = dif < -gap

df.loc[Up,'predict'] = 'Up'
df.loc[Down,'predict'] = 'Down'

df_result = df.dropna()
Total = df.shape[0]
count = df_result.shape[0]
ratio = count/Total
print(f'Total: {Total}; count: {count}; ratio: {ratio}')


The result is always



Total: 90; count: 90; ratio: 1.0


when it shouldn't be. Thank you in advance










share|improve this question





























    0















    I have a weird issue that the result doesn't change for each iteration. The code is the following:



    import pandas as pd
    import numpy as np

    X = np.arange(10,100)
    Y = X[::-1]
    Z = np.array([X,Y]).T

    df = pd.DataFrame(Z ,columns = ['col1','col2'])
    dif = df['col1'] - df['col2']

    for gap in range(100):
    Up = dif > gap
    Down = dif < -gap

    df.loc[Up,'predict'] = 'Up'
    df.loc[Down,'predict'] = 'Down'

    df_result = df.dropna()
    Total = df.shape[0]
    count = df_result.shape[0]
    ratio = count/Total
    print(f'Total: {Total}; count: {count}; ratio: {ratio}')


    The result is always



    Total: 90; count: 90; ratio: 1.0


    when it shouldn't be. Thank you in advance










    share|improve this question

























      0












      0








      0








      I have a weird issue that the result doesn't change for each iteration. The code is the following:



      import pandas as pd
      import numpy as np

      X = np.arange(10,100)
      Y = X[::-1]
      Z = np.array([X,Y]).T

      df = pd.DataFrame(Z ,columns = ['col1','col2'])
      dif = df['col1'] - df['col2']

      for gap in range(100):
      Up = dif > gap
      Down = dif < -gap

      df.loc[Up,'predict'] = 'Up'
      df.loc[Down,'predict'] = 'Down'

      df_result = df.dropna()
      Total = df.shape[0]
      count = df_result.shape[0]
      ratio = count/Total
      print(f'Total: {Total}; count: {count}; ratio: {ratio}')


      The result is always



      Total: 90; count: 90; ratio: 1.0


      when it shouldn't be. Thank you in advance










      share|improve this question














      I have a weird issue that the result doesn't change for each iteration. The code is the following:



      import pandas as pd
      import numpy as np

      X = np.arange(10,100)
      Y = X[::-1]
      Z = np.array([X,Y]).T

      df = pd.DataFrame(Z ,columns = ['col1','col2'])
      dif = df['col1'] - df['col2']

      for gap in range(100):
      Up = dif > gap
      Down = dif < -gap

      df.loc[Up,'predict'] = 'Up'
      df.loc[Down,'predict'] = 'Down'

      df_result = df.dropna()
      Total = df.shape[0]
      count = df_result.shape[0]
      ratio = count/Total
      print(f'Total: {Total}; count: {count}; ratio: {ratio}')


      The result is always



      Total: 90; count: 90; ratio: 1.0


      when it shouldn't be. Thank you in advance







      python-3.x pandas loops numpy na






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 25 '18 at 9:11









      mathguymathguy

      1197




      1197
























          1 Answer
          1






          active

          oldest

          votes


















          1














          Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



          import pandas as pd
          import numpy as np

          X = np.arange(10,100)
          Y = X[::-1]
          Z = np.array([X,Y]).T

          df = pd.DataFrame(Z ,columns = ['col1','col2'])
          df2 = df.copy()#added this line to preserve the original df
          dif = df['col1'] - df['col2']

          for gap in range(100):
          df = df2.copy()#reset the altered df back to the original
          Up = dif > gap
          Down = dif < -gap

          df.loc[Up,'predict'] = 'Up'
          df.loc[Down,'predict'] = 'Down'

          df_result = df.dropna()
          Total = df.shape[0]
          count = df_result.shape[0]
          ratio = count/Total
          print(f'Total: {Total}; count: {count}; ratio: {ratio}')





          share|improve this answer
























            Your Answer






            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "1"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53466086%2flooping-over-pandas-dataframe%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1














            Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



            import pandas as pd
            import numpy as np

            X = np.arange(10,100)
            Y = X[::-1]
            Z = np.array([X,Y]).T

            df = pd.DataFrame(Z ,columns = ['col1','col2'])
            df2 = df.copy()#added this line to preserve the original df
            dif = df['col1'] - df['col2']

            for gap in range(100):
            df = df2.copy()#reset the altered df back to the original
            Up = dif > gap
            Down = dif < -gap

            df.loc[Up,'predict'] = 'Up'
            df.loc[Down,'predict'] = 'Down'

            df_result = df.dropna()
            Total = df.shape[0]
            count = df_result.shape[0]
            ratio = count/Total
            print(f'Total: {Total}; count: {count}; ratio: {ratio}')





            share|improve this answer




























              1














              Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



              import pandas as pd
              import numpy as np

              X = np.arange(10,100)
              Y = X[::-1]
              Z = np.array([X,Y]).T

              df = pd.DataFrame(Z ,columns = ['col1','col2'])
              df2 = df.copy()#added this line to preserve the original df
              dif = df['col1'] - df['col2']

              for gap in range(100):
              df = df2.copy()#reset the altered df back to the original
              Up = dif > gap
              Down = dif < -gap

              df.loc[Up,'predict'] = 'Up'
              df.loc[Down,'predict'] = 'Down'

              df_result = df.dropna()
              Total = df.shape[0]
              count = df_result.shape[0]
              ratio = count/Total
              print(f'Total: {Total}; count: {count}; ratio: {ratio}')





              share|improve this answer


























                1












                1








                1







                Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



                import pandas as pd
                import numpy as np

                X = np.arange(10,100)
                Y = X[::-1]
                Z = np.array([X,Y]).T

                df = pd.DataFrame(Z ,columns = ['col1','col2'])
                df2 = df.copy()#added this line to preserve the original df
                dif = df['col1'] - df['col2']

                for gap in range(100):
                df = df2.copy()#reset the altered df back to the original
                Up = dif > gap
                Down = dif < -gap

                df.loc[Up,'predict'] = 'Up'
                df.loc[Down,'predict'] = 'Down'

                df_result = df.dropna()
                Total = df.shape[0]
                count = df_result.shape[0]
                ratio = count/Total
                print(f'Total: {Total}; count: {count}; ratio: {ratio}')





                share|improve this answer













                Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



                import pandas as pd
                import numpy as np

                X = np.arange(10,100)
                Y = X[::-1]
                Z = np.array([X,Y]).T

                df = pd.DataFrame(Z ,columns = ['col1','col2'])
                df2 = df.copy()#added this line to preserve the original df
                dif = df['col1'] - df['col2']

                for gap in range(100):
                df = df2.copy()#reset the altered df back to the original
                Up = dif > gap
                Down = dif < -gap

                df.loc[Up,'predict'] = 'Up'
                df.loc[Down,'predict'] = 'Down'

                df_result = df.dropna()
                Total = df.shape[0]
                count = df_result.shape[0]
                ratio = count/Total
                print(f'Total: {Total}; count: {count}; ratio: {ratio}')






                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Nov 25 '18 at 9:20









                mathguymathguy

                1197




                1197
































                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53466086%2flooping-over-pandas-dataframe%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    這個網誌中的熱門文章

                    Hercules Kyvelos

                    Tangent Lines Diagram Along Smooth Curve

                    Yusuf al-Mu'taman ibn Hud