How to Bulk Create and Update the Right Way in Golang (Part I)

Different approaches with benchmarks

Published in

Better Programming

5 min readMar 21, 2020

I have recently stepped out of my comfort zone which is Python, to get my hand dirty with the Golang project I’m currently working on at my company.

The reason I’m writing this article is that I want to share the experiences I faced while creating and updating multiple records to PostgreSQL in Golang, and the ORM library I currently use (GORM) does not support bulk-create and update operations yet.

In this article, we will work through different approaches for the above-mentioned operations and benchmark them. So, let’s get started.

Firstly, I need to create a new project folder containing all the source code:

mkdir go-bulk-create && cd go-bulk-create

Then, I initialize a new Go project with the go module:

go mod init github.com/TrinhTrungDung/go-bulk-create

We’re going to need to use the gorm dependency, install it as follows:

go get -u github.com/jinzhu/gorm

I used Docker Compose to create a new isolated PostgreSQL database instance inside my local machine container, named test.db:

Create a new file, docker-compose.yml, in the current directory.

touch docker-compose.yml

The content of the file should look like the below:

Run the docker-compose file to create a new database:

docker-compose up -d

Now that we’ve got a new database, we create a new file to try all the approaches I’m talking about:

touch main.go

We need to connect to our database successfully first. The content of main.go will be:

We create a sample table named User and migrate it to the database.

Benchmarking

Let’s create the test file first:

touch main_test.go

The file contains some helper functions as below:

That’s all we need for the utilities, let’s do the benchmark tests for each approach.

1. Insert multiple records using GORM

We add the below benchmark test function for the ORM bulk-create to the main_test.go file:

Run the below command to get the benchmark for speed and memory performance:

go test -benchmem -run=^$ github.com/TrinhTrungDung/go-bulk-create -bench BenchmarkOrmCreate

Test result:

Test result for bulk-create by ORM

We obtain that the function runs in one second (by default), it creates 240 records at the approximate rate of 5.5ms/iteration, allocates about 9000 B/iteration and 105 distinct memory allocations per iteration.

Pretty bad for a larger system…

2. Insert multiple records by natively using the INSERT statement

We add another bulk-create function using the INSERT statement to the main_test.go file:

Run the test function with the command below:

go test -benchmem -run=^$ github.com/TrinhTrungDung/go-bulk-create -bench BenchmarkCreate

Test results:

What? You must be wondering why we got errors like this while benchmarking the above function.

If we read the error in the console, we notice that the maximum number of parameters that PostgreSQL supports is 65535. Each iteration, we append two more parameters (name, password) so the maximum number of iterations must be less than or equal to 65535/2 = 32767.

However, you can see a significant improvement in terms of speed and memory. In one second, it generates around 150000 records and costs us 8354 nanoseconds/iteration which is faster than over 600 times the first approach!

Additionally, it cuts down approximately ten times in terms of the number of bytes and the number of memory allocations per iteration. Well, that’s a huge improvement but there’s no guarantee of data consistency if we implement this approach in the real world.

3. Insert multiple records using the INSERT statement with batching size

Let’s see if this approach is a rescue for the second one or not.

First, add a new bulk-create function just like the BenchmarkCreate function in the main_test.go file. However, we will chunk the list of instances in multiple small chunks with the batch size of 500 as per the following code:

Run the test function with the command below:

go test -benchmem -run=^$ github.com/TrinhTrungDung/go-bulk-create -bench BenchmarkBulkCreate

Test result:

OK, it might perform a little bit worse than the second approach but now it guarantees consistency when inserting new records, which is the most important thing you must think about first.

Right now, you should have some questions like: “What happens when we make the batch size dynamically or insert more columns?” We get to the last approach.

4. Insert multiple records using the INSERT statement with dynamic batching size

First, we refactor the benchmarkBulkCreate function to accept another parameter named size:

We now benchmark the benchmarkBulkCreate function with the respective sizes of 1, 100, 500, and 1000.

Test benchmarks with a different batch size by running the command below:

go test -benchmem -run=^$ github.com/TrinhTrungDung/go-bulk-create -bench BenchmarkBulkCreateSize

Test results:

Benchmark testing with different batch sizes

There’s no difference for the number of records we create per second or the creation speed when we make the batch size greater than 100. It applies to the number of allocated bytes and memory allocation per iteration too.

Let’s see what happens if we change the number of insert parameters. To do that, we simply change the helper function as below:

Test results:

Benchmark testing with different batch sizes and additional parameters

Well, it seems no difference when you add more parameters when comparing stats between different batch sizes.

Conclusion

I personally choose the INSERT statement with a batch size of 500 by default since it is stable for all use cases. In the next part, I will do the benchmarking for the bulk update one.

Here’s the sample project.

If you have a better solution, please share it with me in the comment section.

Better Programming

How to Bulk Create and Update the Right Way in Golang (Part I)

Different approaches with benchmarks

Benchmarking

1. Insert multiple records using GORM

2. Insert multiple records by natively using the INSERT statement

3. Insert multiple records using the INSERT statement with batching size

4. Insert multiple records using the INSERT statement with dynamic batching size

Conclusion

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Published in Better Programming

Written by Dũng Trung Trịnh

Responses (7)

More from Dũng Trung Trịnh and Better Programming

Step-by-step guide to translation in React Native

In this short guide, I will try to document the following —

Fixing RAG with Reasoning Augmented Generation

🛠️ Why RAG is Broken — And How ReAG Fixes It

How to Resolve the “‘mvn’ is not recognized as an internal or external command, operable program or…

Maven (mvn) commands are widely used to automate tasks such as managing dependencies, building, packaging, testing, and deploying Java…

How to download remote PDF files in React Native

Step by step guide to downloading remote pdf files in react native

Recommended from Medium

Mastering Push Notifications in React Native: A Comprehensive Guide with Examples (2024) 🚀

Push notifications are crucial for engaging users in mobile applications. In this guide, we’ll explore the essential aspects of push…

Swipe Refresh in React Native.

In this article we will see how to implement Swipe Refresh in React Native. Swipe Refresh means when you swipe down , a loader will come…

Expo + Firebase Push Notifications

Here we are with another text, this time related to Push Notifications, and integrating the Firebase Push Notifications inside Expo apps…

Part 1: Push Notifications in React Native 2024

I recently had a chance to deep dive into Push notifications at my workplace and learned a lot of things. I had to start with what is push…

Solving Expo Notification Issues on Preview Build: The Project ID Fix

When building mobile applications, notifications are a crucial feature, helping to keep users engaged and updated. If you’re using Expo…

Create Native Android and Ios module in React native: Use Widget in Native IoS

Create a Native Android Widget in React Native