Search Results for: kafka

Great Expectations: Data Pipeline Testing with Abe Gong

A data pipeline is a series of steps that takes large data sets and creates usable results from them. At the beginning of a data pipeline, a data set might be pulled from a database, a

What is a Layer 2 Cloud Provider?

The rise of “cloud infrastructure” has presented a dilemma for developers: what is the appropriate level of complexity for a cloud provider to handle? In the last decade, the options

The Data Exchange with Ben Lorica

Data infrastructure has been transformed over the last fifteen years.  The open source Hadoop project led to the creation of multiple companies based around commercializing the

Presto with Justin Borgman

A data platform contains all of the data that a company has accumulated over the years. Across a data platform, there is a multitude of data sources: databases, a data lake, data

Nubank Data Engineering with Sujith Nair

Nubank is a popular bank that is based in Brazil. Nubank has more than 20 million customers, and has accumulated a high volume of data over the six years since it was started. Mobile