Why We Developed DBStream

Tue, 05/19/2015 - 17:10 — Arian Bär (Fors...

TL;DR

Systems for processing network monitoring data need to offer high performance and usability. We think the Open Source system DBStream offers both.

/TL;DR

When I started working at FTW, my main task was to speed up data processing. This was not an easy task, especially since nothing should be sacrificed.
All data should be stored, access should be instantaneous, answers should be exact and the implementation and deployment of typical processing tasks should be easy.
So we looked at multiple tools which where commonly used to solve those challenges. First, we looked at Hadoop, which back then was in a rather early stage, providing not that great performance and for using it people would need to implement Java programs (which was a problem since our team is coming from a C background).
Then we looked at complex index structures like bitmap indices which make file access faster. Here, the main problem was that we would still need to use and handle files, which has all the drawbacks like complex directory structures, concurrent access and many more. In addition, the tools we tried had some problems and did not seem to be production ready. The last thing we look at where database systems. They offered the high level, declarative query language SQL, indexing functionality and are made for storing and handling data.
But, they have the following problems:

Importing data is slow (especially single row INSERTs).
As tables get bigger, performance suffers (especially index structures become slow to maintain).
Parallelization is not supported out of the box.

Therefore, we started looking into ways how to solve those performance issues. We were able to solve the import problem by using a dedicated COPY command offered by PostgreSQL. But, the second and third problem still remained. Therefore, we investigated the structure of our data more closely, in order to find a solution for our problem.

DBStream in the network

Network monitoring data has properties typically found in sensor data. All recorded data is about events which happened in the past.
Those events do not change after they have been monitored. In addition, only new events are added to the end of the stream. Thus, the stream continues to grow and can get very big if recorded over extended time periods.

After evaluating several different approaches, we found that partitioning the data based in the time stamp solves all the mentioned problems. First, for each, e.g., 10 minutes, we create a new table partition. Such a partition does not get that big anymore and therefore indices can be created once and do not have to be updated and deleting old data can be done by just deleting the oldest partitions. In addition, queries can now be executed on multiple partition in parallel, which can be used to speed up processing. Therefore, we decided to use a regular Postgresql database as the data processing and storage engine for DBStream.

What can I do with DBStream?

In the first version of the system, data were imported automatically from a monitoring probe into the database. On the imported data, people could run simple queries, apply deep statistics, export the results for plotting and many other tasks. The problem was that whenever people wanted to access data they had to go back to the imported data, which in fact was very big (e.g. up to 1 TB a day, for some streams). Obviously, an aggregation of several days or weeks of data of this size either takes some time or costs a lot of resources (buying and maintaining more disks). In addition, it is very expensive to store extended amounts of time of data of this size. On the other hand many of the performed analysis actually needed only a small part of the imported data and typically at higher granularities (if you want to plot two weeks of data it does not make sense to use a per (milli-)second aggregation).

To solve this problem, we started to filter and pre-aggregate the data and only store the aggregation results over extended time periods. But if you want to apply this approach manually it is a very cumbersome task, where, e.g., every day someone has to run the aggregation. The next automation step is use Cron jobs to start the aggregation automatically for you. So that is what we implemented and for some tasks it worked fine.

But after some time and multiple implemented aggregations, we realized that this approach is not optimal. Since data is imported in batches, the aggregation can only start after a full batch as finished. Therefore, if we import data in 1 hour batches when should we start the daily aggregation?
If we start it at 0:00 the last hour of data of the previous day, from 23:00:00 to 23:59:59 will not have finished importing since it could only start importing at 23:59:59. Therefore, we somehow need to guess how long it takes to process this hour and then start the aggregation. Now, if there is, e.g., an anomaly causing more data than normal to be produced, the import will take longer and we should start the aggregation later. That means, before starting the aggregation we should actually check if the data is really available and only then start the aggregation.

For just a couple of aggregations this is doable, but problems get more serious when you want to run many aggregations and especially if you want to run aggregations on top of other aggregations. In addition, running the aggregation of a whole day consumes quite some resources, especially if multiple such aggregations are running at the same time. Also monitoring and operating those aggregations gets a mess since there is no single point to get any information about which aggregations are running, which had a problem or failed and what was the reason for failing. You have to look in the log of the Cron job, the PostgreSQL log and in your case in the log output of the stored procedures we wrote to handle processing the aggregations. If a aggregation needs data from two import streams the synchronization becomes really cumbersome.

Those problems led us to design and implement the DBStream system. In DBStream, in addition to the monitoring results, each row has a time stamp.
This time stamp is then used by DBStream to not only partition the data, but also to automatically start any aggregations registered on top of the imported data. Of course, the output of a aggregation can be used as the input to another aggregation and each aggregation can have multiple input streams.

In DBStream the time windows can be set much smaller than a full day. Therefore, imported data are aggregated as they arrive at the system and not only once in the night.

A typical DBStream aggregation job looks like this:

<job inputs="A (window 60min)" output="B" schema="serial_time int4, total_download int8, total_upload int8">
<query>
select _STARTTS, sum(download), sum(upload) from A group by _STARTTS
</query>
</job>

The DBStream SCHEDULER module automatically starts aggregations as soon as enough data has arrived in all input windows of a job. The query specified by the user can use the full SQL syntax offered by PostgreSQL, including also user defined functions.

DBStream performance comparison with Spark.

Performance comparison of DBStream vs. Spark, for details about the executed workload and used datasets please refer to [2].

If this article made you interested in DBStream we recommend you to have a look at the Github page: https://github.com/arbaer/dbstream/ and would like to point you to our most recent publications:
In [1], we give an overview of the different applications we implemented using DBStream and in [2] we presents a performance comparison of DBStream with Apache Spark.

Arian Baer (arian.baer _at_ gmail.com)

[1] Arian Baer, Pedro Casas, Lukasz Golab and Alessandro Finamore "DBStream: An Online Aggregation, Filtering and Processing System for Network Traffic Monitoring", Wireless Communications and Mobile Computing Conference (IWCMC), 2014.

[2] Arian Baer, Alessandro Finamore, Pedro Casas, Lukasz Golab, Marco Mellia "Large-Scale Network Traffic Monitoring with DBStream, a System for Rolling Big Data Analysis", IEEE International Conference on Big Data (IEEE BigData), 2014.

Arian Bär (Forschungszentrum Telekommunikation Wien Gmbh)'s blog

Latest News

"Characterizing IPv4 Anycast Adoption and Deployment" awarded the IRTF Applied Networking Research Prize

We are proud to announce that the IRTF awarded the Applied Networking Research Prize 2016 (ANRP 2016) to the paper Cicalese, D., J. Auge, D. Joumblatt, T. ur Friedman, and D. Rossi, "Characterizing IPv4 Anycast Adoption and Deployment", ACM CoNEXT, Heidelberg, ACM, 12/2015.Congratulation to Dario's team, and to mPlane for supporting this research!! And thanks to the IRTF and ISOC for...

No QUIC anymore?

UPDATE: After 6 days, still no QUIC traffic... was it so bad? VP1 VP2VP3Saturday 5/12/2015 - It seems Google just stopped serving QUIC on all its servers. Bug or what? :)

mPlane talk @ IRTF RAIM meeting are available online

Notes from the 2015 IRTF & ISOC Workshop on Research and Applications of Internet Measurements (RAIM) in cooperation with ACM SIGCOMM are available online at http://tid.isoc.org:9001/p/raim-2015.Check talks from B. Trammell about mPlane architecture [video], P. Casas talking about results in 3G/4G networks [video].

mPlane Workshop registration is now open!

Registration for the mPlane workshop is now open!Come to see all the great work done in mPlane, to meet people, and to enjoy lively discussion with prominent researchers in the network measurement field!Check all information and how to register in the Workshop webpage.

Special issue on Machine learning, data mining and Big Data frameworks for network monitoring and troubleshooting

mPlane organizes an Elsevier Communication Networks special issue on "Machine learning, data mining and Big Data frameworks for network monitoring and troubleshooting". Please find the call for paper here. Call for papersThe complexity of the Internet has dramatically increased in the last few years, making it more important and challenging to design scalable network traffic monitoring...

EuCNC 2015 mPlane booth - Paris

mPlane will be present at the European Conference on Networks and Communications (EuCNC 2015)!mPlane will be present as exhibitors at the European Conference on Networks and Communications (EuCNC 2015), in Paris. Come to see us! You'll have the chance to see Demonstrations, Flyers, talk to our experts, and get in touch with the mPlane community!Enjoy the demos:Demonstration of the...

mPlane, RITE paper on ECN cited in Apple announcement

Apple has announced at WWDC 2015 (announcement around 34:30 here) that it is turning on ECN by default for client applications in the current developer builds of the next versions of Mac OS X and iOS. In doing so, it cited "Enabling Internet-Wide Deployment of Explicit Congestion Notification", a PAM 2015 paper that was joint work between the FP7 mPlane and RITE projects, on the current state of...

mPlane is technical sponsor of TRAC 2015

mPlane is technical sponsor of the 6th International Workshop on TRaffic Analysis and Characterization, TRAC 2015, which takes place in Dubrovnik, Croatia, from August 24-27 2015. The workshop is technically co-sponsored by IEEE.

mPlane is technical sponsor of TMA 2013

mPlane is technical sponsor of the 5th IEEE International Traffic Monitoring and Analysis Workshop, TMA 2013, which takes place in Turin, Italy, from April 14-19 2013,co-located with IEEE INFOCOM.

mPlane demonstration booth at EuCNC, June 29-July 2, 2015

mPlane has a booth at the European Conference on Networks and Communications (EuCNC) event, held in Paris between June 29 and July 2, 2015.Come and check out our demonstrations there!

mPlane final workshop co-located with ACM CoNEXT, Heidelberg November 30, 2015

NEC will host the final mPlane workshop event in Heidelberg on November 30, 2015.The event is co-localted with ACM CoNEXT.Please stay tuned for additional details.

IEEE JSAC Special issue on Measuring and Troubleshooting the Internet

mPlane organizes a JSAC special issue on "Measuring and Troubleshooting the Internet: Algorithms, Tools and Applications"Please find the call for paper here.Call for papersThe ubiquity of Internet access, and the wide variety of Internet-enabled devices and applications, have made the Internet a principal pillar of the Information Society. However, its distributed nature leads to operational...

mPlane industrial workshop in Barcelona, April 22 2015

mPlane Industrial WorkshopBarcellona - 22 April 2015 The research project mPlane (http://www.ict-mplane.eu), sponsored by the European Commission with the goal of measuring and troubleshooting Internet performance and availability by building an Intelligent Measurement Plane for Future Network and Application Management, organizes an industrial workshop to showcase the technology...

Tracebox mentioned on RIPE entry

Tracebox and mPlane did their first appearance in the Ripe website! Congratulations to the ULG team!

NEC develops new high speed solution for Internet performance monitoring

NEC Laboratories Europe is addressing the challenges presented by today’s distributed and diverse online environment by developing new monitoring and root cause analysis solutions in the research project mPlane.Full press available here.

Factsheet at the end of Second Year

mPlane reached the end of Second Year! Here is a summary of the achievements so far

Marry Christmas and Wonderful 2015

Best wishes for a Merry Christmas and a wonderful 2015 from all the mPlaners!

mPlane invited to participate to IFIP TC6 2014/2 Strategic Review Meeting in Dagstuhl

International Federation for Information Processing (IFIP) is an umbrella organization for national societies working in the field of information technology. The meeting brought together members of the Technical Committee (TC6) representing experts in the field of computer communication and networking, and a group of research leaders to provide expert input into the strategic direction for the...

The Cost of the “S” in HTTPS

The use of HTTPS is increasing and may become the default in HTTP 2.0. The privacy and security benefits of ubiquitous encryption are relatively clear, but what are the costs?Check the paper, the poster, and the presentation to see the answer!

mPlane poster

Approaching the end of the second year, mPlane is now going into dissemination and demonstration!We prepared a poster that summarizes the project aims and status. Thanks go to TI!

4th PhD School on Traffic Monitoring and Analysis (TMA)

The 4th Traffic Monitoring and Analysis (TMA) PhD School was successfully held in London, UK , April 14-16th 2014, with about 40 participants. The school was operated in cooperation with ACM SIGCOMM that kindly sponsored the event, and was for the first time held in conjunction with the 6th International Workshop on Traffic Monitoring and Analysis , increasing the interaction of PhDs with...

mPlane invited to participate in the FIRE-GENI workshop

mPlane has been invited to partcipate to the Second GENI/FIRE Collaboration Workshop - May 5-6, 2014 Cambridge MA. "mPlane – an Intelligent Measurement Plane for Future Network and Application Management"The focus is on Instrumentation and Measurement - interoperability among monitoring testbed - a clear example where the mPlane architecture can be a winner.

Brian Trammell appointed as IAB member!

On 13 February 2014, the NomCom announced the selection of the IAB slate whose terms will start at IETF 89 in March 2014: Mary Barnes Marc Blanchet (incumbent) Ted Hardie Joe Hildebrand Eliot Lear (incumbent, 1 year term) Brian TrammellCongratulation to Brian!!!

4th PhD School on Traffic Monitoring and Analysis

The 4th PhD School on Traffic Monitoring and Analysis (TMA) will be held in London, right after the TMA workshop.Deadline to register is March 12th 2014. Registration gives free access to the school, and to the TMA workshop!

Dagsthul Seminar on "Global Measurement Framework" is over

It was a very interesting opportunity to share ideas and have some discussions in a very friedly environment. Wine was not so bad either .Now it's time to tag mPlanner from the official photo

mPlane is technical sponsor of TRAC 2014

mPlane is technical sponsor of the 5th International Workshop on TRaffic Analysis and Characterization, TRAC 2014, which takes place in Nicosia, Cyprus, from August 4-8 2014. The workshop is technically co-sponsored by IEEE, and is chaired by Pedro Casas and Brian Trammell.

mPlane paper "IP Mining: Extracting Knowledge from the Dynamics of the Internet Addressing Space" got the Best Paper Award at ITC25

The paper Pedro Casas, Pierdomenico Fiadino, and Arian Bär from FTW received the Best Paper Award for the paper "IP Mining: Extracting Knowledge from the Dynamics of the Internet Addressing Space", presented at the 25th International Teletraffic Congress, ITC25, 2013, got the BEST PAPER AWARD!Congratulation to Pedro, Pierdomenico, Arian and all the FTW people!

mPlane paper appears on Slashdot

The IMC paper "Benchmarking Personal Cloud Storage" has been mentioned on Slashdot!The work has been funded within mPlane. Congratulations to Idilio and Enrico!

mPlane@IMC13

Great experience this year in Barcelona for IMC13!mPlane is Gold sponsor, and four mPlane papers presented there!Drago, I., E. Bocchi, M. Mellia, H. Slatman, and A. Pras, "Benchmarking Personal Cloud Storage", Internet Measurement Conference - IMC, Barcelona (ES), ACM, 10/2013. Vanaubel, Y., J-J. Pansiot, P. Mérindol, and B. Donnet, "Network...

TMA workshop website and CFP is available

The web site and Call for Paper of 6th Workshop on Traffic Monitoring and Analysis is up! Given the topic, this is a relevant workshop for mPlanners.Deadline for submission: November 15!

mPlane mentioned at the AGCOM workshop in Rome

The mPlane project has been mentioned during the Italian Regulatory Agency AGCOMM Workshop that has been held in Roma. The workshop focus has been on the "Qualità dell’accesso ad Internet da rete fissa in Italia" [Quality of Internet Access lines in Italy].See here for more information (in Italian)

Deliverable D3.1 completed

The Deliverable D3.1 - Basic Network Data Analysis has been completed and made available to the public. It describes the requirements, input, output for the algorithms needed to perform analytic tasks on a large amount of data, in the context of WP3. Starting from the use cases defined in WP1, we identify the algorithms needed to address the various scenario requirements.

External Advisory Board complete!

The mPlane External Advisory Board is now complete!Welcome to Mark, Fabian, Alberto and Lukasz and many thanks for their time and support to the mPlane project!

Dagsthul Seminar on "Global Measurement Framework"

mPlane contributes to the organisation of the Dagsthul Seminar on "Global Measurement Framework".OrganizersPhilip Eardley (BT Research, GB)Marco Mellia (Politecnico di Torino, IT)Jörg Ott (Aalto University, FI)Jürgen Schönwälder (Jacobs University – Bremen, DE)Henning Schulzrinne (Columbia University, US)The Dagsthul will take place from Sunday, November 17 to Wednesday,...

2nd plenary meeting 27,28,29 May in Paris!

The 2nd mPlane plenary meeting will be held in Paris, kindly hosted by ENST. Book your agenda!- sloppy start monday 27 may (11h30 -18h00)- full day tuesday 28 may (9h-18h00)- sloppy end wed 29 may (9h-16h00)

"Inside Dropbox: Understanding Personal Cloud Storage Services" awarded the IRTF Applied Networking Research Prize

We are proud to announce that the IRTF awarded the Applied Networking Research Prize 2013 (ANRP 2013) to the paperDrago, I., M. Mellia, M. Munafo', A. Sperotto, R. Sadre, and A. Pras, "Inside Dropbox: Understanding Personal Cloud Storage Services", Internet Measurement Conference - IMC, Boston, MA, ACM, 11/2012Congratulation to Idilio, and to mPlane for...

Happy 2013 from mPlane

Check below some statistics on how people celebrated the new year's eve @ midnight!It seems people stopped using the WEB after all, and started drinking some good glasses of spumante in the real world!And let's call my friend to whish him a very nice 2013!! Check the number of VoIP calls:Same anomaly from a larger time scale:Happy new 2013 to all mPlanners!

Check the pictures of the mPlane kick off meeting in Torino

A selection of pictures during the mPlane kickoff meeting are available. Feel free to find yourself

mPlane officially supports the 5th IEEE International Traffic Monitoring and Analysis Workshop (TMA 2013)

mPlane is proud to anounce the officiail support to the 5th IEEE International Traffic Monitoring and Analysis Workshop (TMA 2013) that will be help in Torino, Italy, April 19, 2013. We look forward to see interesting paper on traffic analysis, and some exicting news from mPlane.

Presentation of mPlane at the Cloud-based Service Platforms for the Future Internet workshop, ICCLab

Presentation of mPlane at the Cloud-based Service Platforms for the Future Internet workshop, ICCLab, Zürich University of Applied Sciences, Winterthur, Switzerland [PDF]

mPlane on the national newspaper "La Stampa"

mPlane made it to the "La stampa" national newspaper. You can try google translator if you really wish.

Previous Pause Next

Intranet

News RSS

Main menu

Public

You are here

Why We Developed DBStream

TL;DR

/TL;DR

What can I do with DBStream?

Latest News

Main menu

Public

You are here

Why We Developed DBStream

TL;DR

/TL;DR

What can I do with DBStream?

Latest News

Search form