# clustersort TODO: - calculate 'sums' by dot product of X_tfidf with the vector of the lengths of the document collection in X