Don't like ads? PRO users don't see any ads ;-)
Guest

Untitled

By: a guest on May 27th, 2012  |  syntax: None  |  size: 1.00 KB  |  hits: 27  |  expires: Never
download  |  raw  |  embed  |  report abuse  |  print
Text below is selected. Please press Ctrl+C to copy to your clipboard. (⌘+C on Mac)
  1. Parsing contents of paragraph elements with Nokogiri
  2. <p>
  3.   <font size="5" face="Arial, Helvetica, sans-serif" color="#00CCAA" class="">
  4.     <font color="#AAFF33" class="">
  5.       October 10, 1990 - Maybe a Title
  6.     </font>-
  7.     <font size="4" class="">
  8.       Some long text here.        
  9.       <font color="#66CC00" class="">
  10.         <a href="SourceTitle/date.pdf">[Blah Blah, October 27, 1982 p. 2</a>
  11.         ]
  12.       </font>.
  13.       More content.
  14.       <font color="#00FF33" class="">[Another Source, 1971, issue 01/4]
  15.       </font>.
  16.     </font>
  17.     <font size="5" face="Arial, Helvetica, sans-serif" color="#00CCAA" class="">
  18.       <font color="#AAFF33" class=""><font size="4" color="#00CCAA" class="">
  19.         Another fantastic article.
  20.         <a href="SourceTitle/Date.pdf">[Some Source, October 4, p.6]</a>
  21.       </font>
  22.     </font>
  23.   </font>
  24. </font>
  25. </p>
  26.        
  27. >> doc.xpath('//p').each do |node|
  28. ..     puts node.xpath("font[@size='5']/font").first.content.strip
  29. ..   end #=> 0
  30. October 10, 1990 - Maybe a Title