Economist on "Text Analysis Question"
I've been using NLTK recently to do some text analysis. I'd like to know the following: If I have 3000 sentences, and each sentence has a value x (say, sentence_one = 0.35), is there a way to know...
View ArticleStatsBro on "Text Analysis Question"
Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...
View ArticleEconomist on "Text Analysis Question"
Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...
View ArticleEconomist on "Text Analysis Question"
Have a look at the classify module or alternatively combine with StatsModels
View ArticleStatsBro on "Text Analysis Question"
^^ yes - it's very context dependent but in general I would start with frequent terms - usually the way it's done is in the cleaning process you specify a sparsity level to remove sparse terms so...
View Article