How to parse xml using scrapy The Next CEO of Stack OverflowHow to merge two dictionaries in a single expression?How do I check if a list is empty?How do I check whether a file exists without exceptions?How can I safely create a nested directory in Python?How do I parse a string to a float or int in Python?How do I sort a dictionary by value?How to make a chain of function decorators?How do I parse XML in Python?How do I list all files of a directory?How Do You Parse and Process HTML/XML in PHP?

Can we say or write : "No, it'sn't"?

Is it convenient to ask the journal's editor for two additional days to complete a review?

Example of a Mathematician/Physicist whose Other Publications during their PhD eclipsed their PhD Thesis

Is it okay to majorly distort historical facts while writing a fiction story?

Is the D&D universe the same as the Forgotten Realms universe?

Easy to read palindrome checker

Can MTA send mail via a relay without being told so?

What connection does MS Office have to Netscape Navigator?

Yu-Gi-Oh cards in Python 3

What is meant by "large scale tonal organization?"

Is it ever safe to open a suspicious HTML file (e.g. email attachment)?

What happened in Rome, when the western empire "fell"?

How do I align (1) and (2)?

Make solar eclipses exceedingly rare, but still have new moons

Does increasing your ability score affect your main stat?

Is it my responsibility to learn a new technology in my own time my employer wants to implement?

0-rank tensor vs vector in 1D

How to check if all elements of 1 list are in the *same quantity* and in any order, in the list2?

Do I need to write [sic] when a number is less than 10 but isn't written out?

Won the lottery - how do I keep the money?

Are police here, aren't itthey?

Why specifically branches as firewood on the Altar?

Defamation due to breach of confidentiality

Is it professional to write unrelated content in an almost-empty email?



How to parse xml using scrapy



The Next CEO of Stack OverflowHow to merge two dictionaries in a single expression?How do I check if a list is empty?How do I check whether a file exists without exceptions?How can I safely create a nested directory in Python?How do I parse a string to a float or int in Python?How do I sort a dictionary by value?How to make a chain of function decorators?How do I parse XML in Python?How do I list all files of a directory?How Do You Parse and Process HTML/XML in PHP?










1















How to scrape the XML using scrapy.



My XML looks something like this:



 <rss xmlns:media="http://search.yahoo.com/mrss/" version="2.0">
<channel>
<generator>NFE/5.0</generator>
<title>"python" - Google News</title>
<link>
https://news.google.com/search?q=python&hl=en-IN&gl=IN&ceid=IN:en
</link>
<language>en-IN</language>
<webMaster>news-webmaster@google.com</webMaster>
<copyright>2019 Google Inc.</copyright>
<lastBuildDate>Thu, 07 Mar 2019 16:48:55 GMT</lastBuildDate>
<description>Google News</description>
<item>
<title>
Brown snake attacks python eating a rat - NEWS.com.au
</title>
</channel>
</rss>


My code looks like this:



from scrapy.spiders import XMLFeedSpider
from scrapy.http import HtmlResponse
from scrapy.selector import Selector


response = HtmlResponse(url='https://news.google.com/rss/search?q=python&hl=en-IN&gl=IN&ceid=IN:en')
xxs = Selector(response)
obj = xxs.xpath('//title/text()').extract()


I want to get the text in the title tag. But here I'm getting an empty list. Please help me out. It's important.
Thanks a lot










share|improve this question


























    1















    How to scrape the XML using scrapy.



    My XML looks something like this:



     <rss xmlns:media="http://search.yahoo.com/mrss/" version="2.0">
    <channel>
    <generator>NFE/5.0</generator>
    <title>"python" - Google News</title>
    <link>
    https://news.google.com/search?q=python&hl=en-IN&gl=IN&ceid=IN:en
    </link>
    <language>en-IN</language>
    <webMaster>news-webmaster@google.com</webMaster>
    <copyright>2019 Google Inc.</copyright>
    <lastBuildDate>Thu, 07 Mar 2019 16:48:55 GMT</lastBuildDate>
    <description>Google News</description>
    <item>
    <title>
    Brown snake attacks python eating a rat - NEWS.com.au
    </title>
    </channel>
    </rss>


    My code looks like this:



    from scrapy.spiders import XMLFeedSpider
    from scrapy.http import HtmlResponse
    from scrapy.selector import Selector


    response = HtmlResponse(url='https://news.google.com/rss/search?q=python&hl=en-IN&gl=IN&ceid=IN:en')
    xxs = Selector(response)
    obj = xxs.xpath('//title/text()').extract()


    I want to get the text in the title tag. But here I'm getting an empty list. Please help me out. It's important.
    Thanks a lot










    share|improve this question
























      1












      1








      1








      How to scrape the XML using scrapy.



      My XML looks something like this:



       <rss xmlns:media="http://search.yahoo.com/mrss/" version="2.0">
      <channel>
      <generator>NFE/5.0</generator>
      <title>"python" - Google News</title>
      <link>
      https://news.google.com/search?q=python&hl=en-IN&gl=IN&ceid=IN:en
      </link>
      <language>en-IN</language>
      <webMaster>news-webmaster@google.com</webMaster>
      <copyright>2019 Google Inc.</copyright>
      <lastBuildDate>Thu, 07 Mar 2019 16:48:55 GMT</lastBuildDate>
      <description>Google News</description>
      <item>
      <title>
      Brown snake attacks python eating a rat - NEWS.com.au
      </title>
      </channel>
      </rss>


      My code looks like this:



      from scrapy.spiders import XMLFeedSpider
      from scrapy.http import HtmlResponse
      from scrapy.selector import Selector


      response = HtmlResponse(url='https://news.google.com/rss/search?q=python&hl=en-IN&gl=IN&ceid=IN:en')
      xxs = Selector(response)
      obj = xxs.xpath('//title/text()').extract()


      I want to get the text in the title tag. But here I'm getting an empty list. Please help me out. It's important.
      Thanks a lot










      share|improve this question














      How to scrape the XML using scrapy.



      My XML looks something like this:



       <rss xmlns:media="http://search.yahoo.com/mrss/" version="2.0">
      <channel>
      <generator>NFE/5.0</generator>
      <title>"python" - Google News</title>
      <link>
      https://news.google.com/search?q=python&hl=en-IN&gl=IN&ceid=IN:en
      </link>
      <language>en-IN</language>
      <webMaster>news-webmaster@google.com</webMaster>
      <copyright>2019 Google Inc.</copyright>
      <lastBuildDate>Thu, 07 Mar 2019 16:48:55 GMT</lastBuildDate>
      <description>Google News</description>
      <item>
      <title>
      Brown snake attacks python eating a rat - NEWS.com.au
      </title>
      </channel>
      </rss>


      My code looks like this:



      from scrapy.spiders import XMLFeedSpider
      from scrapy.http import HtmlResponse
      from scrapy.selector import Selector


      response = HtmlResponse(url='https://news.google.com/rss/search?q=python&hl=en-IN&gl=IN&ceid=IN:en')
      xxs = Selector(response)
      obj = xxs.xpath('//title/text()').extract()


      I want to get the text in the title tag. But here I'm getting an empty list. Please help me out. It's important.
      Thanks a lot







      python xml web-scraping scrapy






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 7 at 16:58









      A. AnandA. Anand

      61




      61






















          1 Answer
          1






          active

          oldest

          votes


















          0














          You are getting forbidden by robots.txt.
          You need to change this behavior in the settings.py and change ROBOTSTXT_OBEY=Trueto ROBOTSTXT_OBEY=False.






          share|improve this answer























          • It's still not working

            – A. Anand
            Mar 8 at 4:23











          Your Answer






          StackExchange.ifUsing("editor", function ()
          StackExchange.using("externalEditor", function ()
          StackExchange.using("snippets", function ()
          StackExchange.snippets.init();
          );
          );
          , "code-snippets");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "1"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55049168%2fhow-to-parse-xml-using-scrapy%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          0














          You are getting forbidden by robots.txt.
          You need to change this behavior in the settings.py and change ROBOTSTXT_OBEY=Trueto ROBOTSTXT_OBEY=False.






          share|improve this answer























          • It's still not working

            – A. Anand
            Mar 8 at 4:23















          0














          You are getting forbidden by robots.txt.
          You need to change this behavior in the settings.py and change ROBOTSTXT_OBEY=Trueto ROBOTSTXT_OBEY=False.






          share|improve this answer























          • It's still not working

            – A. Anand
            Mar 8 at 4:23













          0












          0








          0







          You are getting forbidden by robots.txt.
          You need to change this behavior in the settings.py and change ROBOTSTXT_OBEY=Trueto ROBOTSTXT_OBEY=False.






          share|improve this answer













          You are getting forbidden by robots.txt.
          You need to change this behavior in the settings.py and change ROBOTSTXT_OBEY=Trueto ROBOTSTXT_OBEY=False.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Mar 7 at 17:35









          H. DucatiH. Ducati

          31




          31












          • It's still not working

            – A. Anand
            Mar 8 at 4:23

















          • It's still not working

            – A. Anand
            Mar 8 at 4:23
















          It's still not working

          – A. Anand
          Mar 8 at 4:23





          It's still not working

          – A. Anand
          Mar 8 at 4:23



















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55049168%2fhow-to-parse-xml-using-scrapy%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Save data to MySQL database using ExtJS and PHP [closed]2019 Community Moderator ElectionHow can I prevent SQL injection in PHP?Which MySQL data type to use for storing boolean valuesPHP: Delete an element from an arrayHow do I connect to a MySQL Database in Python?Should I use the datetime or timestamp data type in MySQL?How to get a list of MySQL user accountsHow Do You Parse and Process HTML/XML in PHP?Reference — What does this symbol mean in PHP?How does PHP 'foreach' actually work?Why shouldn't I use mysql_* functions in PHP?

          Compiling GNU Global with universal-ctags support Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30pm US/Eastern) Data science time! April 2019 and salary with experience The Ask Question Wizard is Live!Tags for Emacs: Relationship between etags, ebrowse, cscope, GNU Global and exuberant ctagsVim and Ctags tips and trickscscope or ctags why choose one over the other?scons and ctagsctags cannot open option file “.ctags”Adding tag scopes in universal-ctagsShould I use Universal-ctags?Universal ctags on WindowsHow do I install GNU Global with universal ctags support using Homebrew?Universal ctags with emacsHow to highlight ctags generated by Universal Ctags in Vim?

          Add ONERROR event to image from jsp tldHow to add an image to a JPanel?Saving image from PHP URLHTML img scalingCheck if an image is loaded (no errors) with jQueryHow to force an <img> to take up width, even if the image is not loadedHow do I populate hidden form field with a value set in Spring ControllerStyling Raw elements Generated from JSP tagds with Jquery MobileLimit resizing of images with explicitly set width and height attributeserror TLD use in a jsp fileJsp tld files cannot be resolved