eric_han

CleanLinks Strings

Aug 26th, 2016 (edited)
222
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.74 KB | None | 0 0
  1. Skip Links matching with:
  2.  
  3. ServiceLogin|imgres\?|watch%3Fv|auth\?client_id|signup|bing\.com/widget|oauth|openid\.ns|\.mcstatic\.com|sVidLoc|[Ll]ogout|submit\?url=|magnet:|(?:# Start TT strings to allow URLs. Since JavaScript regex is sadly lacking a comment syntax, fake one. This "comment" tells regex parser to look for a (string in a URL) that starts with a '#'. Then a space. Then the rest of this comment. The combination of which I am pretty sure the parser will not find.)|^abp:|\bbing.com/account\b|\bgoogle.com/(?:(?:set)?pref(?:erence)?s?|recaptcha)\b|^moz-extension://\b|\bonlinevideoconverter.com/download\?file=
  4.  
  5. Remove From Links:
  6.  
  7. (?:ref|aff)\w*|utm_\w+|(?:merchant|programme|media)ID|(?:# Start TT keywords to remove. Since JavaScript regex is sadly lacking a comment syntax, fake one. This "comment" tells parser to look for a (keyword in a URL) that starts with a '#'. Then a space. Then the rest of this comment. Then a '='. The combination of which I am pretty sure the parser will not find.)|li|(?:ref(?:er(?:r?er)?)?)\w*
  8.  
  9. Skip Domains:
  10.  
  11. accounts.google.com,docs.google.com,translate.google.com,login.live.com,plus.google.com,www.facebook.com,twitter.com,static.ak.facebook.com,www.linkedin.com,www.virustotal.com,account.live.com,admin.brightcove.com,www.mywot.com,webcache.googleusercontent.com,web.archive.org,accounts.youtube.com,signin.ebay.com,(?:# Start TT domains. For consistency,use bogus JavaScript regex comment syntax. Domain field is not a regex. So this "comment" tells parser to look for a domain that starts with a left parenthesis. Which I am pretty sure the parser will not find. Same for whatever comes after commas within this comment.),security.google.com,translate.googleusercontent.com,ixquick-proxy.com,phish-error.mozilla.com
Add Comment
Please, Sign In to add comment