Category: Blog

  • On Sampling

    In a past exercise (the one on a simplified visualization of Netflows), I had a very big file, and I wanted to extract a subset of the data to demo a visualization. The first part of the entry was about a way to divide a big file into smaller chunks, and extract samples with a…

  • Reading CSVs faster in R

    Intro They say most of the time spent, when working in Data Science, is spent on preparing the data for the analysis. This has definitely been my experience so far. That one time, not so long ago, a Wednesday night, I was working on a report – actually I made it a Dashboard (yes, Shiny).…

  • A (very) simple “Network Traffic Visualization” using Network Graphs

    A network visualization has nodes and edges. I like to apply it to actual IT Networks Traffic visualizations, where there are connections (edges) between two IP addresses/machines (i.e. “nodes”). Let’s try and apply that to some Netflow data… Getting the data After some searching (that was some time ago now), I came across secrepo.com. There…