Advertisement
hxrussia

Robust Regexp for Profanity Detection in Russian Texts

Jul 21st, 2017
98
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
Python 0.39 KB | None | 0 0
  1. expr = re.compile(r'''(?ix)
  2.    (?<!подо)(?<!потре)(?<!оскор)
  3.        (бля[дт]|\bбля\b)|
  4.    (?<!пло)
  5.        ху[йеияюёе]|
  6.    п[иё]зд|
  7.    (?<!PHP-)
  8.        \b(|вы|взъ|долбо|за|мозго|мудо|на|невзъ|невъ|недо|до|
  9.        объ|отъ|подза|пере|подъ|по|при|про|раз|съ|у)[её]б
  10. ''')
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement