Member-only story
New Panda..s has come
Let’s discuss one of the most widely used libraries in the field of data science — Pandas.
Pandas is a renowned library for data manipulation and analysis, widely used by data related people. Pandas provides powerful tools for handling and processing structured data.
Recently, the major version 2.0.0 of Pandas has been released with several new features and improvements.
In this article, we will be highlighting some of the main changes in the new version of Pandas. Although there are many small changes, we will be focusing only on the significant ones and highlight three of them.
- New Apache Arrow back-end
- Copy-on-write function
- Indexes
Apache Arrow
The recent introduction of a new Apache Arrow backend for pandas data marks an important change in the library. In the usual workflow, relevant data must first be loaded into memory before performing any operations in pandas. It is necessary to determine how the data will be stored in memory during this loading process.
Representing simple data types such as integers or floats is generally straightforward since the standard method of representation for a single item is commonly known, and we…