Economics Job Market Rumors Topic: Text Analysis Question

↧

Economist on "Text Analysis Question"

March 6, 2016, 2:17 pm

I've been using NLTK recently to do some text analysis. I'd like to know the following: If I have 3000 sentences, and each sentence has a value x (say, sentence_one = 0.35), is there a way to know...

View Article

Economist on "Text Analysis Question"

March 6, 2016, 2:18 pm

reg x log(x)

View Article

Economist on "Text Analysis Question"

March 6, 2016, 4:02 pm

bump

View Article

Economist on "Text Analysis Question"

March 6, 2016, 4:08 pm

have you tried log naics?

View Article

StatsBro on "Text Analysis Question"

March 6, 2016, 4:19 pm

Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...

View Article

Economist on "Text Analysis Question"

March 6, 2016, 4:22 pm

Yes - create a document term matrix from the data - this is a matrix of dummies for word mentions in each observation (you can also do bi-grams, tri-grams, etc.) but you'll likely have to do some...

View Article

Economist on "Text Analysis Question"

March 6, 2016, 4:28 pm

Have a look at the classify module or alternatively combine with StatsModels

View Article

StatsBro on "Text Analysis Question"

March 6, 2016, 4:29 pm

^^ yes - it's very context dependent but in general I would start with frequent terms - usually the way it's done is in the cleaning process you specify a sparsity level to remove sparse terms so...

View Article