Investigating the Internet using Lisp
Preamble
This article was originally written by E. Costa and sent to the members of the Della Vos group. The text below is a translation from the original in Portuguese, with some minor editorial additions.
Investigating the internet using Lisp
There are two ways to investigate Internet content with Lisp. The first is to use an http client tool, for example:
https://edicl.github.io/drakma/
https://github.com/fukamachi/dexador
https://github.com/orthecreedence/carrier
https://github.com/fukamachi/fast-http
https://github.com/svenvc/s-http-client
The last four http clients are better than drakma, for their small footprint.
For best results, you should also use a natural language processing system, such as the tagger:
https://github.com/g000001/tagger
Of course, if you use the tagger or another system in Common Lisp, don’t forget to use quicklisp to install. A good example of how to show results and make research reports on the Internet is this COVID-19 data exploration system:
https://github.com/dnaeon/cl-covid19
Of course, what I mentioned above is for amateur systems. If you really want powerful systems, then you’ll need to contact Cycorp:
Cycorp offers powerful natural language resources and is being used in large projects, such as the Cleveland Clinic’s natural language interface for biomedical information: