Apache Hudi: Large Scale Data Systems with Vinoth Chandar


Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC data pipeline development (AWS.amazon.com).

In this episode we speak to Vinoth Chandar, VP of Apache Hudi. Vinoth is the creator of the Hudi project at Uber. He continues to lead its evolution at the Apache Software Foundation. Previously he was a Principal Engineer at Confluent, and a Sr Staff Engineer/Manager at Uber before that. We discuss building large scale distributed and data systems.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

Transcript

Transcript provided by We Edit Podcasts. Software Engineering Daily listeners can go to weeditpodcasts.com to get 15% off the first three months of audio editing and transcription services with code: SED. Thanks to We Edit Podcasts for partnering with SE Daily. Please click here to view this show’s transcript.

Sponsors

Triplebyte is a network of 200,000+ Top Engineers. Triplebyte works with more than 400 tech companies including Coinbase, Zoox, Dropbox, and Facebook.  Triplebyte is focused on matching high-quality engineers with great jobs. Let the right roles come to you. Want to know your strengths? Take the Triplebyte quiz and receive your personalized feedback report. Tracks offered: Generalist, Front End Mobile, Machine Learning, DevOps, DataScience, and Entry Level. Visit triplebyte.com/sedaily.

Today’s podcast is brought to you by Google Cloud and DORA research team. The team recently launched a survey to collect insights for the 2021 State of DevOps report and would love your input! The State of DevOps report is the largest and longest running research of its kind, providing insight into how we can improve software delivery performance with DevOps. By completing the survey, you get to shape the conversation on DevOps along with over 30 thousand software professionals who took the survey over the past six years. So what are you waiting for? Take the survey at cloud.google.com/devops!

Oracle wants to help you land those big customers, so they’re offering preferred pricing on enterprise cloud for startups. Free cloud credits and 70% off their cloud services, and with multi-cloud support and no vendor lock-in, you can build it out any way you want. Oracle for Startups doesn’t want you wheezing on the side of the road. They want you to have enough power to scale and land your dream customer. Visit oracle.com/go/sedaily.

Cox Automotive transforms the way that the world buys, sells, owns, and uses cars – and they have the data to understand how the world gets from point A to point B. With brands like Kelley Blue Book, Autotrader, Dealer.com and others, Cox Automotive is hiring software engineers, data scientists, scrum masters, and other technologists to help create meaningful change in the industry. If you want to innovate in a collaborative workplace, one that values your time and work-life balance, then visit  COXAUTOTECH.COM  to find career opportunities at Cox.