Quantcast
Browsing latest articles
Browse All 16 View Live

Economist on "Text Analysis Question"

I've been using NLTK recently to do some text analysis. I'd like to know the following: If I have 3000 sentences, and each sentence has a value x (say, sentence_one = 0.35), is there a way to know...

View Article


Economist on "Text Analysis Question"

reg x log(x)

View Article


Economist on "Text Analysis Question"

bump

View Article

Economist on "Text Analysis Question"

have you tried log naics?

View Article

StatsBro on "Text Analysis Question"

Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...

View Article


Economist on "Text Analysis Question"

Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...

View Article

Economist on "Text Analysis Question"

Have a look at the classify module or alternatively combine with StatsModels

View Article

StatsBro on "Text Analysis Question"

^^ yes - it's very context dependent but in general I would start with frequent terms - usually the way it's done is in the cleaning process you specify a sparsity level to remove sparse terms so...

View Article

Browsing latest articles
Browse All 16 View Live