It All Starts With Data Ingestion

Learn More & Download Free Hadoop Ingestion App Today

Sample Ingestion Sources: Kafka, HDFS, AWS S3n, NFS, (s)FTP, JMS


DataTorrent RTS For Enterprise

The industry’s only open source

enterprise-grade unified stream and batch platform

Open Source Enterprise-Grade Batch Processing Platform
Open Source Enterprise-Grade Platform
Enterprise Security Integration - DataTorrent Real-time Streaming (RTS)
Enterprise Security Integration
Big Data Integration & Data Streaming Analytics
Data Integration & Analytics
dtAssemble Graphical Application Assembly
dtAssemble - Gui-based Application Assembly
Real-Time Streaming and Data Visualization
dtDashboard - Self-service Real-Time Data Visualization
Data ingestion and Distribution for Hadoop - DataTorrent
dtIngest - Data ingestion and distribution for Hadoop

Connect with our Partners



Hear from DataTorrent Customers

Pubmatic - DataTorrent Partner “DataTorrent RTS, which runs on Amazon EMR,is powering  PubMatic’s real-time Ad analytics platform enabling publishers to drive the highest value for their digital media assets. It also enables advertisers to provide consumers with a more personalized advertising experience across display, mobile and video.”   – Sudhir Kulkarni | VP of Data & Analytics

SilverSpring Networks - DataTorrent Partner “At Silver Spring we deploy and operate some of the largest, most data-intensive networks on earth, connecting more than 20 million Internet-of-Things devices on five continents.  DataTorrent RTS is an integral component of our SilverLink(tm) Sensor Network solution and together we look forward to inspiring a legion of new developers to create even more powerful big data applications.“   – Jeremy Johnson | Director of Product Management

Latest Blog Posts

Stay connected with what’s going on in the DataTorrent world with the most recent blog posts.

The Next Generation of Big Data and Apache Apex

We live in an era where computational resources are being rapidly commoditized. Cloud is pervasive and is democratizing IT. Big Data led by Apache Hadoop is the lead edge of this revolution from an IT perspective. Just like automobiles did to travel in last century, mobile is doing to communication, and web is doing to information access in the past…Read more »

Latency Calculation in Apache Apex

In stream processing applications, data arrives continuously and needs to be processed expediently in order to keep up with the incoming flow. Latency is the primary metric with which the health of a streaming application is measured. High latency is typically an indication of problems. It can cause the application to be unable to keep up with the flow of incoming…Read more »