Complete gig history

All talks by Olena Kutsenko

4 talks·4 with video·3 years

2025(1)

The art of structuring real-time data streams into actionable insights
atDevoxxMay 2025Antwerp, Belgium🇧🇪
For updates and more, join our community 👉 https://www.linkedin.com/company/devoxx-united-kingdom Real-time data can be messy, unpredictable, and hard to manage. To unlock its full potential, you need a way to turn raw streams into clean, structured data. In this talk, we’ll show you how to use Apache Kafka, Apache Flink, and Apache Iceberg to organize real-time data streams efficiently and prepare them for advanced use cases, including AI applications. We’ll start by explaining how Kafka handles high-speed data streams and how Flink processes these streams in real time. You’ll learn how to use Flink to transform raw data into structured formats, ensuring it’s ready for storage and analysis. Then, we’ll dive into Iceberg, demonstrating how it stores and organizes structured data for easy querying, versioning, and integration with machine learning pipelines. Through clear examples, we’ll walk you through building a practical pipeline that turns chaotic data streams into organized schemas. By the end of the session, you’ll know how to manage real-time data effectively and set the stage for downstream AI and analytics. Whether you’re a beginner or an experienced developer, this talk will give you the tools to simplify and enhance your data pipelines!

Loading talks…

Back to Olena Kutsenko's profile

2023(2)

ClickHouse: what is behind the fastest columnar database

atDevoxxOct 2023Antwerp, Belgium🇧🇪

🎙 Olena Kutsenko, Senior Developer Advocate @Aiven 🔗 https://twitter.com/OlenaKutsenko ☑ Website: https://devoxx.com.ua/ ☑ Facebook: https://www.facebook.com/DevoxxUkraine ☑ Instagram: https://www.instagram.com/devoxxua/ ☑ Twitter: https://twitter.com/DevoxxUA ☑ YouTube: https://www.youtube.com/@DevoxxUkraine Devoxx Ukraine 2023 partners: 🫶 Platinum Partner & Organizer - EPAM Ukraine https://careers.epam.ua 🫶 Silver Partner - SPD Technology https://spd.tech/ 🫶 Streaming partner - Mediastream https://mediastream.com.ua/

keynote

Watch

Using Apache Kafka and OpenSearch to explore Mastodon

atDevoxxJun 2023Antwerp, Belgium🇧🇪

For more info on the next Devoxx UK event 👉 www.devoxx.co.uk Apache Kafka is a powerful tool to connect multiple systems together, allowing the data to flow across multiple services and be reused for multiple purposes. This can be useful in many scenarios both for mission-critical applications, as well as for fast data explorations. In this talk I’ll show one such data exploration. Mastodon, as a tool for microblogging, is rising in popularity in recent months. If you just recently joined Mastodon and are still exploring it, you might find that scrolling the timeline has its limits to understand all that is happening there. That being the case, applying some engineering skills will give a better overview on topics and discussions happening on the platform. Since Mastodon's timeline is nothing more than a collection of continuously arriving events, its feed is well-suited for Apache Kafka. Adding Kafka connectors on top of that opens multiple opportunities to use data for aggregations and visualizations. During this talk you'll learn how to bring data from Mastodon to Kafka using TypeScript and a couple of helpful libraries. Once the data is in the topic, we'll use Kafka Connect to bring the data into OpenSearch and use it for search, aggregations and visualizations. This talk is for both beginners in Apache Kafka and intermediate users. We'll use some more advanced concepts, but will keep it all simple, so that everyone can follow along and experiment with Mastodon data!

keynote

All talks by Olena Kutsenko

2025(1)

The art of structuring real-time data streams into actionable insights

2025(1)

The art of structuring real-time data streams into actionable insights

2023(2)

ClickHouse: what is behind the fastest columnar database

Using Apache Kafka and OpenSearch to explore Mastodon

2022(1)

Optimal Data Lake for analytics: Apache Kafka and ClickHouse