Better Programming

Advice for programmers.

Follow publication

Member-only story

New Panda..s has come

Mark Eltsefon
Better Programming
Published in
4 min readApr 22, 2023

--

Photo by Quentin Grignet on Unsplash

Pandas is a renowned library for data manipulation and analysis, widely used by data related people. Pandas provides powerful tools for handling and processing structured data.

Recently, the major version 2.0.0 of Pandas has been released with several new features and improvements.

In this article, we will be highlighting some of the main changes in the new version of Pandas. Although there are many small changes, we will be focusing only on the significant ones and highlight three of them.

  1. New Apache Arrow back-end
  2. Copy-on-write function
  3. Indexes

Apache Arrow

The recent introduction of a new Apache Arrow backend for pandas data marks an important change in the library. In the usual workflow, relevant data must first be loaded into memory before performing any operations in pandas. It is necessary to determine how the data will be stored in memory during this loading process.

Representing simple data types such as integers or floats is generally straightforward since the standard method of representation for a single item is commonly known, and we…

--

--

Mark Eltsefon
Mark Eltsefon

Written by Mark Eltsefon

A data scientist focusing on experimentation and constantly seeking new knowledge.

Write a response