Apache Drill is an open-source distributed query engine that enables one to aggregate data from various sources (including raw files) using the industry standard ANSI SQL query language.
Here is the link to the article on Apache Drill:
Enjoy !!!
Articles | Notes | Tips | Tutorials
Apache Drill is an open-source distributed query engine that enables one to aggregate data from various sources (including raw files) using the industry standard ANSI SQL query language.
Here is the link to the article on Apache Drill:
Enjoy !!!
Apache Parquet is an efficient open source data storage format that works well with various Data Processing Frameworks.
Here is the link to the article on Apache Parquet:
Introduction to Apache Parquet
Enjoy !!!
In Part-7 (FINAL) of this series, we will explore the different Stream Joining techniques in Kafka Streams. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 7
Enjoy !!!
In Part-6 of this series, we will re-explore the different Windowing techniques of Kafka Streams for aggregation, but with two changes โ first is to extract the Kafka message timestamp and second is to use an in-memory state store. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 6
Enjoy !!!
In Part-5 of this series, we will explore the different Windowing techniques of Kafka Streams for aggregation. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 5
Enjoy !!!
In Part-4 of this series, we will shift gears to explore other aspects of Kafka Streams such as Custom data Serialization and Deserialization. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 4
Enjoy !!!
In Part-3 of this series, we will continue to dig in a little deeper into Kafka Streams to understand State Store and Record Cache. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 3
Enjoy !!!
In Part-2 of this series, we will start to dig in a little deeper into Kafka Streams to understand Stream Tasks and Stream Threads. We will implement and demonstrate the concepts using a simple application in Java. Here is the link to the article:
Exploring Kafka Streams :: Part 2
Enjoy !!!
In Part-1 of this series, we will install and setup the environment to explore Kafka Streams. Also, we implement and demonstrate two simple applications using Java. Here is the link to the article:
Exploring Kafka Streams :: Part 1
Enjoy !!!
Time for a *REFRESH* โ published Part-2 on Apache Spark (based on version 1.x) way back in 10/2015.
Here is the link to Part-2 of the article on Apache Spark 2.x:
Apache Spark 2.x Quick Notes :: Part โ 2
Enjoy !!!