-
NLP (1/n): Scraping all the Blog articles (the hard way)
Intro To use Natural Language Processing algorithms, we first need data. We’ve seen last time how to scrape ONE article. And how to get to different pages of the Blog. But for this to be usable in the future, we should be able to download all articles UNTIL there is no more (e.g. we probably…
-
Going SQL: Postgres (2/3)
Last week we got a Container up and running with a Postgres SQL Server in it. We’re going to need a Database client to check it. At this point, in theory, we could use some R package (more on that later) to interact with our DBMS container, but instead we’ll go for a Visual client…
-
A revelation: R Futures
Intro I have been accepting (NOT complaining about) R limitation of being mono-core, monothread. For a long time. Actually, for years. I even went all the way to using Docker to distribute compute load across CPUs of one (or more) physical machine. It was just something I had “assumed” was like that. But what little…