Advertisement
maarten1012

HTML to Text

Apr 17th, 2014
48
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
PHP 2.43 KB | None | 0 0
  1. <?php
  2. $descriptorspec = array(
  3.    0 => array("pipe", "r"),  // stdin is a pipe that the child will read from
  4.    1 => array("pipe", "w"),  // stdout is a pipe that the child will write to
  5.    2 => array("file", "/tmp/htmp2txt.log", "a") // stderr is a file to write to
  6. );
  7.  
  8. $process = proc_open('lynx -stdin -dump 2>&1', $descriptorspec, $pipes, '/tmp', NULL);
  9.  
  10. if (is_resource($process)) {
  11.     // $pipes now looks like this:
  12.     // 0 => writeable handle connected to child stdin
  13.     // 1 => readable handle connected to child stdout
  14.     // Any error output will be appended to htmp2txt.log
  15.  
  16.     $stdin = $pipes[0];
  17.     fwrite($stdin,  <<<'EOT'
  18. <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
  19. <head>
  20.  <title>TEST</title>
  21. </head>
  22. <body>
  23. <h1><span>Lorem Ipsum</span></h1>
  24.  
  25. <h4>"Neque porro quisquam est qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit..."</h4>
  26. <h5>"There is no one who loves pain itself, who seeks after it and wants to have it, simply because it is pain..."</h5>
  27. <p>
  28. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Pellentesque et sapien ut erat porttitor suscipit id nec dui. Nam rhoncus mauris ac dui tristique bibendum. Aliquam molestie placerat gravida. Duis vitae tortor gravida libero semper cursus eu ut tortor. Nunc id orci orci. Suspendisse potenti. Phasellus vehicula leo sed erat rutrum sed blandit purus convallis.
  29. </p>
  30. <p>
  31. Aliquam feugiat, neque a tempus rhoncus, neque dolor vulputate eros, non pellentesque elit lacus ut nunc. Pellentesque vel purus libero, ultrices condimentum lorem. Nam dictum faucibus mollis. Praesent adipiscing nunc sed dui ultricies molestie. Quisque facilisis purus quis felis molestie ut accumsan felis ultricies. Curabitur euismod est id est pretium accumsan. Praesent a mi in dolor feugiat vehicula quis at elit. Mauris lacus mauris, laoreet non molestie nec, adipiscing a nulla. Nullam rutrum, libero id pellentesque tempus, erat nibh ornare dolor, id accumsan est risus at leo. In convallis felis at eros condimentum adipiscing aliquam nisi faucibus. Integer arcu ligula, porttitor in fermentum vitae, lacinia nec dui.
  32. </p>
  33. </body>
  34. </html>
  35. EOT
  36.     );
  37.     fclose($stdin);
  38.  
  39.     echo stream_get_contents($pipes[1]);
  40.     fclose($pipes[1]);
  41.  
  42.     // It is important that you close any pipes before calling
  43.     // proc_close in order to avoid a deadlock
  44.     $return_value = proc_close($process);
  45.  
  46.     echo "command returned $return_value\n";
  47. }
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement