Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- with open('threenewsarticles.txt', 'r', encoding='utf8') as my_file:
- rawData = my_file.read()
- print(rawData)
- #Separating body text from metadata. This code only works if the textfile has one article.
- articleStart = rawData.find("<div class=\"story-element story-element-text\">")
- articlemetaData = rawData[:articleStart]
- articleBody = rawData[articleStart:]
- print(articlemetaData)
- print("*******")
- print(articleBody)
- print("*******")
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement