basic nlp resources
the basics of nlp for accounting and finance with python
resources for researchers, like me, who have never done nlp work for food
- basic understanding of the concepts of natural language processing
- recommended reading: speech and language processing, by jurafsky & martin
- chapters 1-4 at least, skip the stuff on speech recognition
- some basic introduction to python and programming
- doesn’t matter what it is you just need to be able to set up a development environment and function in python
- i recommend the anaconda distribution, and conda package manager
- when i taught this i based my introduction on the indroduction section from quant-econ they’ve since reorganized
- github is probably worth your time
- learn the basics of interacting with edgar, and using regular expressions in python
- anand et al. (2020) is a reasonable overview
- Regex 101 is an excellent resource for testing and reference
- sometimes visualizing a complex expression is helpful
- find examples similar to what you are trying to do
- something like gow et al. (2021) gives a good description of the development process
- gow’s
ling_features
package provides excellent examples of how to conduct a number of useful tasks
- set up your own system for interacting with edgar
- this depends on your hardware and internet resources