2019-2020 Fellows:

  • Simone Durney, Ecology
  • Jenna Mattice, Biochemistry
  • Elijah Meyer, Statistics
  • Madison Nelson, Physics
  • Taylor Preul, Fish and Wildlife Management
  • George Schaible, Biochemistry
  • Sam Verplanck, Mechanical Engineering
  • Mei Ling Wong, LRES

R Script

This R script can be copied and pasted into R.  In R, the script will read an input file or multiple input files and calculate the amount of jargon present in the file(s).  It will also create a list of words that may be jargon, so the writer can consider those words when writing for a general audience.  An optional zip file contains pre-assembled corpora, as well as two example files. 

Below is a graph of the documents we have used to benchmark j values.  Also shown are two word clouds, and it can be seen that the majority of words spoken in TED talks are quite different than the majority of words in NASA E-books

 

NASA E book word cloudTED talk word cloudBenchmark jargon values

 Funded by the National Science Foundation, grant #1735124.                                               NSF Logo