NLP lexical resources

WordNet

Resource: https://wordnet.princeton.edu/
WordNet® is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Synsets are interlinked by means of conceptual-semantic and lexical relations. The resulting network of meaningfully related words and concepts can be navigated with the browser(Link is external). WordNet is also freely and publicly available for download. WordNet’s structure makes it a useful tool for computational linguistics and natural language processing.

Glitch Text Generator

Resource: https://glyphy.io/font-generator/glitch-text
Use our glitch text generator to design creepy text for your social media accounts. Copy and paste these cursed fonts to add some weirdness to your profiles!

Corrupted-Text Python Library

A python library to generate out-of-distribution text datasets. Specifically, the library applies model-independent, commonplace corruptions (not model-specific, worst-case adversarial corruptions). We thus aim to allow benchmark-studies regarding robustness against realistic outliers.
PIP
pip install corrupted-text
Article: Text Augmentation Using Corrupted-Text Python Library

TensorFlow Data Augmentation API

Guide: Text and natural language processing with TensorFlow