A guiding principle in data science and in machine learning. If our data is poor quality, we end up having poor quality results.

Stop words: we remove them in NLP analysis because they don’t provide anything new for us.