Advertisement
rucinski69

convert pdf to unformatted txt

Oct 6th, 2020 (edited)
1,262
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Bash 0.48 KB | None | 0 0
  1. #!/usr/bin/bash
  2. #create *sh file and move to /usr/bin
  3. if ls *pdf 1> /dev/null 2>&1; then
  4. for i in *pdf;do pdftotext "$i" "$i".txt;done
  5. for i in *pdf.txt;do tr "\n" " " <"$i">"$i".txt;done
  6. for i in *pdf.txt.txt; do sed -e "s/.\{1000\}/&\n\n/g" <"$i">"$i".txt;done
  7. for i in *pdf.txt.txt.txt; do tr '[:upper:]' '[:lower:]' <"$i">"$i".txt;done
  8. rm *pdf.txt *pdf.txt.txt *pdf.txt.txt.txt #*pdf
  9. rename "s/pdf.txt.txt.txt.txt/txt/" *.pdf.txt.txt.txt.txt
  10. else echo "files do not exist"
  11. fi
  12.  
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement