Profanity Study
Useful stuff
Overall ideas:
- Profanity detection web service ? - fast
-
Profanity test tool - generate test data (see leetspeak related)
-
https://stackoverflow.com/questions/273516/how-do-you-implement-a-good-profanity-filter
- https://stackoverflow.com/a/273520/7468990
- http://habitatchronicles.com/2007/03/the-untold-history-of-toontowns-speedchat-or-blockchattm-from-disney-finally-arrives/
- https://blog.codinghorror.com/obscenity-filters-bad-idea-or-incredibly-intercoursing-bad-idea/
Business alternatives
Opensource Lists for profanity detection service
- https://github.com/topics/profanity
- https://github.com/google/re2
- https://github.com/RobertJGabriel/Google-profanity-words/blob/master/list.txt
- https://github.com/chucknorris-io/swear-words/blob/master/en
- https://github.com/words/cuss/blob/master/index.json
- https://github.com/MauriceButler/badwords
Leetspeak related
- https://github.com/snipe/banbuilder
- https://github.com/snipe/banbuilder/blob/master/src/CensorWords.php#L183
Checking regex for safety
- https://github.com/substack/safe-regex - check if regex is dangerous
- http://www.cs.bham.ac.uk/~hxt/research/rxxr2/ - regex static analyser
Other random resources
Old regex test tool - http://www.ultrapico.com/Expresso.htm