Divide & Recombine for Big Data Analysis for Cybersecurity - Application of DNS Blacklist Query Study
Project Members
Dr. William S. Cleveland, John Gerth
Dr. William S. Cleveland, John Gerth
Abstract
D\&R is a statistical approach to big data that provides comprehensive,
detailed analysis. This is achieved
because almost any analytic method from machine learning, statistics, and
visualization can be applied to the data at their finest level of granularity.
D\&R also enables feasible, practical computation because the computations are
largely embarrassingly parallel. Our work has two core threads.
1. Tailor the D\&R environment to analyse big data in cybersecurity.
2 Apply this tailored environment the Spamhaus traffic at the Stanford
University mirror.