Spark Streaming textFileStream not picking new filesspark ssc.textFileStream is not streamining new filesSpark Streaming not detecting new HDFS filesSpark streaming does not read files moved from HDFS to HDFSSpark Streaming: java.io.FileNotFoundException: File does not exist: <input_filename>._COPYING_Counting with Spark StreamingSpark Streaming : textFileStream doesn't monitor the folder or stream filesSpark textFileStream not reading filessubmitting spark word count job using hidden REST is creating output hdfs directory with only temporary fileSparkStreaming textFileStreaming doesn't give outputSpark streaming from local file to hdfs. textFileStream

Student evaluations of teaching assistants

Coordinate position not precise

What to do with wrong results in talks?

The plural of 'stomach"

Is it correct to write "is not focus on"?

is this a spam?

Tiptoe or tiphoof? Adjusting words to better fit fantasy races

What is difference between behavior and behaviour

Is HostGator storing my password in plaintext?

Stereotypical names

How will losing mobility of one hand affect my career as a programmer?

What's the purpose of "true" in bash "if sudo true; then"

At which point does a character regain all their Hit Dice?

Your magic is very sketchy

What't the meaning of this extra silence?

(Bedrock Edition) Loading more than six chunks at once

Is expanding the research of a group into machine learning as a PhD student risky?

There is only s̶i̶x̶t̶y one place he can be

Is there any reason not to eat food that's been dropped on the surface of the moon?

How do I rename a LINUX host without needing to reboot for the rename to take effect?

Applicability of Single Responsibility Principle

How does it work when somebody invests in my business?

Failed to fetch jessie backports repository

Opposite of a diet



Spark Streaming textFileStream not picking new files


spark ssc.textFileStream is not streamining new filesSpark Streaming not detecting new HDFS filesSpark streaming does not read files moved from HDFS to HDFSSpark Streaming: java.io.FileNotFoundException: File does not exist: <input_filename>._COPYING_Counting with Spark StreamingSpark Streaming : textFileStream doesn't monitor the folder or stream filesSpark textFileStream not reading filessubmitting spark word count job using hidden REST is creating output hdfs directory with only temporary fileSparkStreaming textFileStreaming doesn't give outputSpark streaming from local file to hdfs. textFileStream













0















I am trying to execute spark streaming word count application using spark-submit command. I started the spark program and when I copy new files to the local folder path, the file is not recognized and no operation is performed. In the log, I find error stating,



19/03/03 17:22:17 INFO handler.ContextHandler: Started o.s.j.s.ServletContextHandler@87b5b49/static/streaming,null,AVAILABLE,@Spark
19/03/03 17:22:17 INFO streaming.StreamingContext: StreamingContext started
19/03/03 17:22:21 WARN dstream.FileInputDStream: Error finding new files java.lang.NullPointerException



I use below command to submit spark job,



spark-submit --class com.company.stream.WordCounter --master local[4] /home/workspace/spark/SparkWordCounter.jar



Below is the source code,



val lines = ssc.textFileStream("/home/workspace/spark/data")
val words = lines.flatMap(line => line.split(","))
val pairs = words.map(word => (word, 1))
val wordCount = pairs.reduceByKey(_ + _)
wordCount.print()
ssc.start()
ssc.awaitTermination()


Interestingly, it works seemlessly in Windows and the file I place is picked and program working fine.



Even, if I poll HDFS folder path "hdfs:///home/workspace/data", the program picks the file and read perfectly. But only not happening in CentOs local folder path.










share|improve this question




























    0















    I am trying to execute spark streaming word count application using spark-submit command. I started the spark program and when I copy new files to the local folder path, the file is not recognized and no operation is performed. In the log, I find error stating,



    19/03/03 17:22:17 INFO handler.ContextHandler: Started o.s.j.s.ServletContextHandler@87b5b49/static/streaming,null,AVAILABLE,@Spark
    19/03/03 17:22:17 INFO streaming.StreamingContext: StreamingContext started
    19/03/03 17:22:21 WARN dstream.FileInputDStream: Error finding new files java.lang.NullPointerException



    I use below command to submit spark job,



    spark-submit --class com.company.stream.WordCounter --master local[4] /home/workspace/spark/SparkWordCounter.jar



    Below is the source code,



    val lines = ssc.textFileStream("/home/workspace/spark/data")
    val words = lines.flatMap(line => line.split(","))
    val pairs = words.map(word => (word, 1))
    val wordCount = pairs.reduceByKey(_ + _)
    wordCount.print()
    ssc.start()
    ssc.awaitTermination()


    Interestingly, it works seemlessly in Windows and the file I place is picked and program working fine.



    Even, if I poll HDFS folder path "hdfs:///home/workspace/data", the program picks the file and read perfectly. But only not happening in CentOs local folder path.










    share|improve this question


























      0












      0








      0








      I am trying to execute spark streaming word count application using spark-submit command. I started the spark program and when I copy new files to the local folder path, the file is not recognized and no operation is performed. In the log, I find error stating,



      19/03/03 17:22:17 INFO handler.ContextHandler: Started o.s.j.s.ServletContextHandler@87b5b49/static/streaming,null,AVAILABLE,@Spark
      19/03/03 17:22:17 INFO streaming.StreamingContext: StreamingContext started
      19/03/03 17:22:21 WARN dstream.FileInputDStream: Error finding new files java.lang.NullPointerException



      I use below command to submit spark job,



      spark-submit --class com.company.stream.WordCounter --master local[4] /home/workspace/spark/SparkWordCounter.jar



      Below is the source code,



      val lines = ssc.textFileStream("/home/workspace/spark/data")
      val words = lines.flatMap(line => line.split(","))
      val pairs = words.map(word => (word, 1))
      val wordCount = pairs.reduceByKey(_ + _)
      wordCount.print()
      ssc.start()
      ssc.awaitTermination()


      Interestingly, it works seemlessly in Windows and the file I place is picked and program working fine.



      Even, if I poll HDFS folder path "hdfs:///home/workspace/data", the program picks the file and read perfectly. But only not happening in CentOs local folder path.










      share|improve this question
















      I am trying to execute spark streaming word count application using spark-submit command. I started the spark program and when I copy new files to the local folder path, the file is not recognized and no operation is performed. In the log, I find error stating,



      19/03/03 17:22:17 INFO handler.ContextHandler: Started o.s.j.s.ServletContextHandler@87b5b49/static/streaming,null,AVAILABLE,@Spark
      19/03/03 17:22:17 INFO streaming.StreamingContext: StreamingContext started
      19/03/03 17:22:21 WARN dstream.FileInputDStream: Error finding new files java.lang.NullPointerException



      I use below command to submit spark job,



      spark-submit --class com.company.stream.WordCounter --master local[4] /home/workspace/spark/SparkWordCounter.jar



      Below is the source code,



      val lines = ssc.textFileStream("/home/workspace/spark/data")
      val words = lines.flatMap(line => line.split(","))
      val pairs = words.map(word => (word, 1))
      val wordCount = pairs.reduceByKey(_ + _)
      wordCount.print()
      ssc.start()
      ssc.awaitTermination()


      Interestingly, it works seemlessly in Windows and the file I place is picked and program working fine.



      Even, if I poll HDFS folder path "hdfs:///home/workspace/data", the program picks the file and read perfectly. But only not happening in CentOs local folder path.







      scala apache-spark spark-streaming






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Mar 8 at 10:49







      Saravanan Ponnaiah

















      asked Mar 7 at 11:58









      Saravanan PonnaiahSaravanan Ponnaiah

      214




      214






















          0






          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55043235%2fspark-streaming-textfilestream-not-picking-new-files%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55043235%2fspark-streaming-textfilestream-not-picking-new-files%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Save data to MySQL database using ExtJS and PHP [closed]2019 Community Moderator ElectionHow can I prevent SQL injection in PHP?Which MySQL data type to use for storing boolean valuesPHP: Delete an element from an arrayHow do I connect to a MySQL Database in Python?Should I use the datetime or timestamp data type in MySQL?How to get a list of MySQL user accountsHow Do You Parse and Process HTML/XML in PHP?Reference — What does this symbol mean in PHP?How does PHP 'foreach' actually work?Why shouldn't I use mysql_* functions in PHP?

          Compiling GNU Global with universal-ctags support Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30pm US/Eastern) Data science time! April 2019 and salary with experience The Ask Question Wizard is Live!Tags for Emacs: Relationship between etags, ebrowse, cscope, GNU Global and exuberant ctagsVim and Ctags tips and trickscscope or ctags why choose one over the other?scons and ctagsctags cannot open option file “.ctags”Adding tag scopes in universal-ctagsShould I use Universal-ctags?Universal ctags on WindowsHow do I install GNU Global with universal ctags support using Homebrew?Universal ctags with emacsHow to highlight ctags generated by Universal Ctags in Vim?

          Add ONERROR event to image from jsp tldHow to add an image to a JPanel?Saving image from PHP URLHTML img scalingCheck if an image is loaded (no errors) with jQueryHow to force an <img> to take up width, even if the image is not loadedHow do I populate hidden form field with a value set in Spring ControllerStyling Raw elements Generated from JSP tagds with Jquery MobileLimit resizing of images with explicitly set width and height attributeserror TLD use in a jsp fileJsp tld files cannot be resolved