How to Index all words that are in caps in a group of documents

Status
Not open for further replies.

superhans

Beta member
Messages
1
Hi, I have a group of 20k documents. I would like to have a list of all of the words that they contain which are in capital letters, ordered by frequency. What is the simplest way of doing that?
 
Text file process starting from *.txt . Convert space to a delimination character like % because some languages can't use space as a delimited character. process text with case character equal TRUE store in text variable 1 . If variable 1 contains word modify word integer +1 store as word %integer% .

Then you will have a list of all words with capital and a number next to it contained by%% to indicate how many were found.

Should take about 2 minute to compile depending on the language.
 
Status
Not open for further replies.
Back
Top Bottom