Category: NLP

  • A quick look at CISA KEV

    Intro I keep hearing about it, so it was time I had a look at that famous CISA “Known Exploited Vulnerability” dataset. Also, it’s been a while since I did something more directly related to IT Security, so this is good. It turns out, it’s quite… Simple (the dataset), and even clean, which makes things…

  • NLP (3/n): Parts of Speech Tagging

    Intro So I’ve been working on other things lately, but I wanted to keep improving/practicing a bit with the NLP concepts. As I mentioned already, I spent a bit of my (little) spare time to review concepts. One book I am finding I rely heavily upon is “Mastering Text Mining with R”, from the Packt…

  • NLP (2/n): Tokens of text

    A first (basic) look at the collected text: Tokens It’s all great: We’re able to collect all our blog entries, or well, the text thereof. We are even able to crawl a bit faster, provided we have a multi-core setup. (Worst case scenario, we’re limited in the crawling by the speed of our website’s, mostly).…