Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- p6 = Pattern.compile("$| |—|​|—|’| ");
- StringBuffer modified = new StringBuffer();
- m4 = p6.matcher(rawrep);
- while(m4.find()) {
- m4.appendReplacement(modified, " ");
- }
- m4.appendTail(modified);
- rawrep = modified.toString();
- rawrep = rawrep.toLowerCase();
- pages = rawrep.split("page-break");
- System.out.println("found pages " + pages.length);
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement