Optimize Python Code With Scalene and AI Suggestions

Make your code faster and reduce memory usage with the Scalene profiler

Published in

Better Programming

5 min readFeb 20, 2023

Python is often used with libraries written in other languages behind the scenes. With this level of abstraction, it can be complicated to figure out how to improve performance and memory usage. However, these issues can be solved using a profiler.

A profiler aims to find which code sections take the longest or use the most memory. Scalene is a great Python profiler that targets CPU, GPU, and memory. If combined with AI suggestions, Scalene helps you refactor the problematic sections faster for high-level performance.

How To Use Scalene

To run Scalene, use the command scalene program_name.py. It profiles both CPU, GPU, and memory by default. If you only want one or some of the options, use the flags --cpu, --gpu, and --memory. For example, scalene --cpu --gpu program_name.py profiles only CPU and GPU.

Besides line-level profiling, Scalene also provides function-level profiling. The two types of profiling are kept in separate sections of the output table. The first section includes the line-level profiling of all lines, while the second section includes the function-level profiling of all functions. To profile only the lines and functions with significant usage, add the flag --reduced-profile.

Interfaces

After running the profiling command, it shows the results on an interface. You have two interface options: the Command Line Interface (CLI) and the web interface. To compare them, we will be using the following Python file called test.py.

size = 1000000

# High memory allocation
x = [i for i in range(size)]
y = [i for i in range(size)]

# High computation time
for i in range(size):
    y[i] = y[i] * y[i]

Command line interface

By default, the command scalene test.py will open the web interface. To obtain the CLI instead, add the flag --cli.

In the table, we have three colors present. Blue indicates CPU profiling, green indicates memory profiling, and yellow indicates GPU profiling and copy volume.

CPU profiling gives the time spent running Python code, native code (for example, C or C++), and time spent on the system (for example I/O). In the example, 45% of the total running time is spent on Python code on the line y[i] = y[i] * y[i]. As such, this is one of the lines that we must optimize to increase performance. If you sum all the percentages in the blue columns, you get 100%.

Memory profiling gives the percentage of the memory allocated by Python code. The table also includes the memory usage over time and its peak. As expected, the creation of the x and y vectors lead to the highest memory allocation. To improve performance, we must create them with more efficient allocation functions.

GPU profiling and copy volume give the GPU running time and volume of copying (mb/s), respectively. The copy volume includes copies between GPU and CPU. To note, GPU profiling only supports NVIDIA GPUs.

Web interface

The web interface is quite similar to the CLI. However, some columns are compacted using color shades. For example, we only have a column in blue (for CPU profiling) with three shades representing Python, native, and system time.

Memory and GPU profiling have extra columns. Memory profiling has an extra column indicating the average memory usage. The memory activity shows the memory allocated by Python and native code, differentiated by two shades of green. GPU profiling has an extra column indicating GPU memory usage.

Unlike the CLI, extra files are created called profile.html and profile.json, which include the results shown. If you wish to obtain these with the CLI, use the flags --json and --html .

AI Suggestions

So far, the tools at our disposal have helped us determine which lines and functions to improve. However, we can make our work even faster by generating AI suggestions instead of coming up with them ourselves. Fortunately, Scalene can be used with OpenAI, given an API key.

To get an API key, first sign in to your OpenAI account or create one. Then, click Personal on the top right corner of the screen and choose View API keys . On that page, you can generate a new API key and then copy it to the Scalene web interface on advanced options.

There are two types of suggestions you can choose from. The explosion symbol 💥 gives optimizations for an entire region of code, while the lightning bolt ⚡ proposes only for a line. In the following image, you can see the lightning bolt suggestions for test.py, which mostly include replacements using NumPy.

The optimized version of test.py becomes:

import numpy as np

size = 1000000

x = np.arange(size)
y = np.arange(size)

y = y ** 2

Resources

Scalene paper: https://arxiv.org/pdf/2212.07597.pdf
Scalene GitHub repository: https://github.com/plasma-umass/scalene
OpenAI API website: https://openai.com/api/

Better Programming

Optimize Python Code With Scalene and AI Suggestions

Make your code faster and reduce memory usage with the Scalene profiler

How To Use Scalene

Interfaces

Command line interface

Web interface

AI Suggestions

Resources

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Published in Better Programming

Written by Dora Lourenço, MSc

Responses (2)

More from Dora Lourenço, MSc and Better Programming

How to Enhance Your Git Workflow With Git LFS for Large Data

Manage data with Git LFS

How To Update Your Status During Standup Like a Senior Engineer

A status update is where you can showcase how well you manage ambiguity and is an important way to build trust with your team

Why I Prefer Regular Merge Commits Over Squash Commits

I used to think squash commits were so cool, and then I had to use them all day, every day. Here’s why you should avoid squash

Advice From a Software Engineer With 8 Years of Experience

Practical tips for those who want to advance in their careers

Recommended from Medium

Python Packaging Best Practices

The best way to share your Python project and let others install it is by building and distributing a package.

5 Powerful F-String Tricks Every Python Developer Should Know!

Learn five powerful f-string techniques to write cleaner, faster, and more readable Python code.

Lists

Predictive Modeling w/ Python

Practical Guides to Machine Learning

Coding & Development

Natural Language Processing

Solving Memory Leaks in Python Applications: Debugging with Tracemalloc and Heapy

Memory leaks are subtle issues that occur when a program allocates memory but fails to release it when it’s no longer needed. Over time…

15 Python Things That Lead To Instant PR Rejection

1) Zero type hints

How I Learned to Love `__init__.py`: A Simple Guide😊

💡 Heads Up! Click here to unlock this article for free if you’re not a Medium member!

Complete Guide to Memory Management in C++ (No More Leaks!)

Do you know where variables live in memory when your C++ program runs?

How I Learned to Love `init.py`: A Simple Guide😊