Member-only story

New Panda..s has come

Let’s discuss one of the most widely used libraries in the field of data science — Pandas.

Mark Eltsefon

Published in

Better Programming

4 min readApr 22, 2023

Pandas is a renowned library for data manipulation and analysis, widely used by data related people. Pandas provides powerful tools for handling and processing structured data.

Recently, the major version 2.0.0 of Pandas has been released with several new features and improvements.

In this article, we will be highlighting some of the main changes in the new version of Pandas. Although there are many small changes, we will be focusing only on the significant ones and highlight three of them.

New Apache Arrow back-end
Copy-on-write function
Indexes

Apache Arrow

The recent introduction of a new Apache Arrow backend for pandas data marks an important change in the library. In the usual workflow, relevant data must first be loaded into memory before performing any operations in pandas. It is necessary to determine how the data will be stored in memory during this loading process.

Representing simple data types such as integers or floats is generally straightforward since the standard method of representation for a single item is commonly known, and we…

Better Programming

New Panda..s has come

Let’s discuss one of the most widely used libraries in the field of data science — Pandas.

Apache Arrow

Published in Better Programming

Written by Mark Eltsefon

No responses yet