Columnar Data: Apache Arrow and Parquet with Julien Le Dem and Jacques Nadeau
Column-oriented data storage allows us to access all of the entries in a database column quickly and efficiently. Columnar storage formats are mostly relevant today for performing large analytics jobs. For example, if you are a bank, and you want to get the sum of all of the financial transactions that took place on your system in the last week, you donβt want to iterate through every row in a
Continue reading...