%0 Conference Paper %B International Conference on Big Data, IEEE BigData %D 2014 %T Large-Scale Network Traffic Monitoring with DBStream, a System for Rolling Big Data Analysis %A Arian Bär %A Alessandro Finamore %A Pedro Casas %A Lukasz Golab %A Marco Mellia %K Big Data Analysis %K Data Stream Processing %K network data analysis %K System Performance %X

The complexity of the Internet has rapidly increased, making it more important and challenging to design scalable network monitoring tools. Network monitoring typically requires rolling data analysis, i.e., continuously and incrementally updating (rolling-over) various reports and statistics over high-volume data streams. In this paper, we describe DBStream, which is an SQL-based system that explicitly supports incremental queries for rolling data analysis. We also present a performance comparison of DBStream with a parallel data processing engine (Spark), showing that, in some scenarios, a single DBStream node can outperform a cluster of ten Spark nodes on rolling network monitoring workloads. Although our performance evaluation is based on network monitoring data, our results can be generalized to other big data problems with high volume and velocity.

%B International Conference on Big Data, IEEE BigData %I IEEE %C Washington D.C., USA %8 11/2014 %G eng