Distributed Batch Data Processing

What is Apache Spark? The big data platform that crushed Hadoop

At the heart of Apache Spark is the concept of the Resilient Distributed Dataset (RDD), a programming abstraction that represents an immutable collection of objects that can be split across a ...

The Next Platform

Flink Sparks Next Wave of Distributed Data Processing

If you haven’t heard of Flink until now, get ready for the deluge. As one of a stream of Apache incubator-to-top-level projects turned commercial effort, the data processing engine’s promise is to ...

InfoWorld

Why data contracts need Apache Kafka and Apache Flink

Data contracts are foundational to properly designed and well behaved data pipelines. Kafka and Flink provide the key ...

InfoQ

AWS Introduces Step Functions Distributed Map for Large-Scale Parallel Data Processing

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Jinsong Yu shares deep architectural insights ...

InfoQ

Exploring the Fundamentals of Stream Processing with the Dataflow Model and Apache Beam

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

Sarasota Magazine

The State of Data Engineering Trends: Why Your Data Infrastructure is Undergoing a Git-Style Revolution

The data engineering trends clearly show a move toward maturity. The emphasis is on building reliable, repeatable, and ...

SiliconANGLE

Hortonworks’ YARN Aims to Revolutionize Hadoop Data Processing

Hortonworks is publishing a series of blog posts on its website that explain the basics and finer details of Apache Hadoop YARN. Those who are curious about YARN or want to understand its significance ...

adtmag.com

Managing Batch Processing in an SOA

Most enterprise IT operations rely heavily on batch processing operations. The reliance doesn't go away when you move to a service-oriented architecture (SOA), yet SOA just means online transaction ...

Forbes

What The Pony Express Teaches Us About Crossing The Fast Data Divide

During its 18 months of operation in the early 1860s, the Pony Express was a shining example of the inextricable link between data delivery and data processing. While an innovation for its time, the ...

JournalofAccountancy

ABCs of Batch Processing

In batch processing, if costs are not isolated, high-volume customers and products tend to subsidize lower-volume ones. This article reviews different types of batch activities and how they would be ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results