How to use Fuzzy Topic Model as a Classification Model Input

Question

I have fuzzy clustering for Topic modelling and got this .
There are all total 50 topics[0 to 49] and each topic consists 30 words with a probability multiplicative factor. Now how do I make it as a Classifier input. My final goal to document classification.

Demo

pip install octis
pip install FuzzyTM
from octis.dataset.dataset import Dataset
dataset = Dataset()
dataset.fetch_dataset('DBLP')
data = dataset._Dataset__corpus
print(data[0:5])
pwgt, ptgd = flsaW1.get_matrices()
topics = flsaW1.show_topics()
topics

score 0 · Answer 1 · answered Aug 17 '22 at 12:59

Prepare an evaluation dataset of atleat 100 documents.
It is important to train with right data. Garbage in means gargage out. Manually verify the result of topic modelling.
Prepare word vectors from documents: Gensim algo is better at context capture than countvector/tfid
Try Navier Bayes or Neural network and use the most promising model. Decision Tree do not work well on Text Classification

How to use Fuzzy Topic Model as a Classification Model Input

1 Answers1