Text preprocessing for unsupervised learning: Why it matters, when it misleads, and what to do about it (Political Analysis, 2018)
Bag of what? Simple noun phrase extraction for text analysis (EMNLP, NLP + Computational Social Science, 2016)