the basics of nlp for accounting and finance with python

resources for researchers, like me, who have never done nlp work for food

  1. basic understanding of the concepts of natural language processing
  2. some basic introduction to python and programming
    • doesn’t matter what it is you just need to be able to set up a development environment and function in python
    • i recommend the anaconda distribution, and conda package manager
    • when i taught this i based my introduction on the indroduction section from quant-econ they’ve since reorganized
    • github is probably worth your time
  3. learn the basics of interacting with edgar, and using regular expressions in python
  4. find examples similar to what you are trying to do
    • something like gow et al. (2021) gives a good description of the development process
    • gow’s ling_features package provides excellent examples of how to conduct a number of useful tasks
  5. set up your own system for interacting with edgar
    • this depends on your hardware and internet resources