Advertisement
brandizzi

Find last of a series of linking tweets

Feb 14th, 2014
430
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.57 KB | None | 0 0
  1. import sys
  2. from lxml.etree import HTMLParser, fromstring
  3. from lxml.cssselect import CSSSelector
  4. import requests
  5.  
  6. url = sys.argv[1]
  7. counter = 1
  8.  
  9. while True:
  10.     tweet = requests.get(url)
  11.     doc = fromstring(tweet.content, HTMLParser())
  12.  
  13.     contents_selector = CSSSelector('.tweet-text')
  14.     contents = contents_selector(doc)
  15.     if not contents:
  16.             break
  17.  
  18.     text = contents[0].text
  19.     url_selector = CSSSelector('.twitter-timeline-link')
  20.     url = url_selector(doc)[0].attrib['title']
  21.  
  22.     print("%d: %s %s" % (counter, text, url))
  23.     counter += 1
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement