This is a detailed reproduction of ref.

Sunny Summary

3 steps:

preprocessing.py preprocessing to extract: author, average sentence length, average word length, punctuation profile, sentiment scores, part-of-speech profiles/tags (only in code, not taken into the csv).

TFIDF.py content-wise k-mean......