Notes on Corpus Analysis

From what I remember we talked about more than just age range, but rather searching for key phrases that might help determine age range but also whether the writer was in the military, etc.  Here are some potential options for doing that.

 

Scripts to accomplish phrase searching

Concordance Tools

 

Maybe a good first step would be to play with concordancing and see how far that gets us, and if we need to do some more sophisticated work we could look at Python.  I don't have much experience with R or php, but that might be another potential option as well.