It All Starts With Data Ingestion

Learn More

Sample Ingestion Sources: Kafka, HDFS, AWS S3n, NFS, (s)FTP, JMS


DataTorrent RTS For Enterprise

The industry’s only open source

enterprise-grade unified stream and batch platform

Open Source Enterprise-Grade Batch Processing Platform
Open Source Enterprise-Grade Platform
Enterprise Security Integration - DataTorrent Real-time Streaming (RTS)
Enterprise Security Integration
Big Data Integration & Data Streaming Analytics
Data Integration & Analytics

dtAssemble Graphical Application Assembly
dtAssemble - Gui-based Application Assembly
Real-Time Streaming and Data Visualization
dtDashboard - Self-service Real-Time Data Visualization
Data ingestion and Distribution for Hadoop - DataTorrent
Data ingestion and distribution for Hadoop

Connect with our Partners



Hear from DataTorrent Customers

Pubmatic - DataTorrent Partner “DataTorrent RTS, which runs on Amazon EMR,is powering  PubMatic’s real-time Ad analytics platform enabling publishers to drive the highest value for their digital media assets. It also enables advertisers to provide consumers with a more personalized advertising experience across display, mobile and video.”   – Sudhir Kulkarni | VP of Data & Analytics

SilverSpring Networks - DataTorrent Partner “At Silver Spring we deploy and operate some of the largest, most data-intensive networks on earth, connecting more than 20 million Internet-of-Things devices on five continents.  DataTorrent RTS is an integral component of our SilverLink(tm) Sensor Network solution and together we look forward to inspiring a legion of new developers to create even more powerful big data applications.“   – Jeremy Johnson | Director of Product Management

The ability to ingest, transform, and analyze large volumes of data-in-motion in real-time, coupled with rapid time to production, are two of DataTorrent’s key differentiators. Its open source underpinnings — namely Apache Apex which recently graduated to a Top Level Project, but also Apache Hadoop YARN — should also give potential prospects the reassurance that they are joining a vibrant open-source ecosystem built around in-memory data processing. Crucially, DataTorrent has also focused hard on ease-of-use, for example offering self-service visualization to help quickly deliver insights, as well as GUI-based application development for less technical users.

— Jason Stamper, Analyst, 451 Research

Latest Blog Posts

Stay connected with what’s going on in the DataTorrent world with the most recent blog posts.

Fault-Tolerant File Processing

A majority of the big data setups still use files and streaming applications and platforms are a new concept. For a streaming platform to be widely usable, it should be able to handle files and process them in a performant way. Apache Apex provides out of the box support for handling files. This comes in the form of various operators…Read more »

The Next Generation of Big Data and Apache Apex

We live in an era where computational resources are being rapidly commoditized. Cloud is pervasive and is democratizing IT. Big Data led by Apache Hadoop is the lead edge of this revolution from an IT perspective. Just like automobiles did to travel in last century, mobile is doing to communication, and web is doing to information access in the past…Read more »