Better Programming

Follow publication

Advice for programmers.

Follow publication

The Emergence of the LLM Tech Stack

Exciting developments ahead.

Júlio Almeida

Published in

Better Programming

3 min readApr 3, 2023

The generative AI sphere is advancing at lightning speed now. I think we all miss the Javascript days at this point when something comes out, but it’s just “one more framework.” Today (April 2023), we are witnessing a stack being formed, solidifying, and taking shape. This is what is emerging on the scene:

Storage Layer

Compared to the other stacks, this is equivalent to the Database Layer of any other stack. The special particularity of this one is that we have another paradigm, the Vector Database. Without getting into much detail, this database converts the words into numbers, that are used as “nearest neighbor” indexes to assess how closely similar objects are to one another or to a search query. It's used for semantic search, nowadays as a state of art, the Pinecone database combined with OpenAI is an excellent tool for semantic search. Here is an example from the documentation.

Model Layer

This is the layer where you essentially choose the Large Language Model (LLM). The points are all agnostic here, and you always have embeddings that will use the layer below to access the data and finetuning. Currently, OpenAI dominates the scene, but in the future, open-source models from Meta or StabilityAI could take up some share. The crucial aspect here is that it needs to be plug-and-play.

Service/Chain Layer

If you want to compare anything with this layer, it would be something close to Kubernetes or Terraform, I should say. The high-level part combines all the other components.

Chain architecture, particularly the one developed by LangChain, should become the industry standard in my opinion. It’s quite new, but the idea is solid, and the integration of all concepts looks impressive. It utilizes the layers below and introduces other concepts and patterns, like Agents, to make an entire application work.

Also, it depends on how “low-end” this layer will be. This is where OpenAI plugins also exist, which will be an end-user app by itself. Depending on what you need as an end consumer, plugins like Zapier may be more than enough.

This will be the most crucial layer, regardless of future developments. If you want to use an analogy, LangChain is to OpenAI what Terraform is to AWS or Azure.

UI Layer

This layer is the final layer, the presentation if you will. It would be more closely related to Power BI or Tableau in terms of comparison.

For LangChain, we now have Flow Chain emerging, and it looks amazing; you can view the repository here.

Other contenders

There are always other solutions that tend to be “SaaS for profit” oriented, which also bring strengths, particularly in the enterprise-grade world. One of them is Microsoft, and the image below says it all:

Microsoft Graph is a dear tool for all the .NET developers, myself included. This is just a small stretch of what has been showcased in terms of Office Copilot, as well as other “one tool to conquer them all” types of approaches, like Power BI or Power Apps.

Of course, Google will attempt the same thing, but let’s see how it goes.

Some things stay at it is

The lingua franca is still Python, and other user-friendly tools remain the same, such as Jupyter Notebook and Google Colab.

Final Thoughts

The stack is emerging, and the title “prompt engineering” may not be as charming as it seems nowadays. In my humble opinion, everything is gravitating towards a “distro” of data science. And it looks amazing.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Published in Better Programming

221K Followers

Last published Nov 10, 2023

Advice for programmers.

Written by Júlio Almeida

1.8K Followers

91 Following

Creator of ExtractThinker | Contractor - Focused on extraction in enterprise | Fintech and Legal. https://www.linkedin.com/in/j%C3%BAlio-almeida-21772a125

Responses (1)

Write a response

What are your thoughts?

Also publish to my profile

Jun 4, 2023

Lucid explanation

More from Júlio Almeida and Better Programming

Building an On-Premise Document Intelligence Stack with Docling, Ollama, Phi-4 | ExtractThinker

Towards AI

Júlio Almeida

Building an On-Premise Document Intelligence Stack with Docling, Ollama, Phi-4 | ExtractThinker

Securely build an on-prem Document Intelligence stack with ExtractThinker & local LLMs. Keep data private. Perfect for fintech.

Jan 16

1.2K

How To Update Your Status During Standup Like a Senior Engineer

Better Programming

Edward Huang

How To Update Your Status During Standup Like a Senior Engineer

A status update is where you can showcase how well you manage ambiguity and is an important way to build trust with your team

Oct 20, 2022

4.5K

Why I Prefer Regular Merge Commits Over Squash Commits

Better Programming

Dr. Derek Austin 🥳

Why I Prefer Regular Merge Commits Over Squash Commits

I used to think squash commits were so cool, and then I had to use them all day, every day. Here’s why you should avoid squash

Sep 30, 2022

Extract any Document with Gemini 2.0 | Document Intelligence with ExtractThinker

Towards AI

Júlio Almeida

Extract any Document with Gemini 2.0 | Document Intelligence with ExtractThinker

Powerful IDP with ExtractThinker + Gemini 2.0: OCR, classification, splitting, extraction — build cost-effective document workflows.

Dec 26, 2024

422

See all from Júlio Almeida

See all from Better Programming

Recommended from Medium

Multi Agent Solution for Customer Churn Prediction using Gen AI

Nayan Paul

Multi Agent Solution for Customer Churn Prediction using Gen AI

This blog has below 3 sections :

6d ago

202

Build Smarter AI Agents in Minutes — For Less Than $0!

Mr. Plan ₿ Publication

Ashen Thilakarathna

Build Smarter AI Agents in Minutes — For Less Than $0!

Discover the Secret Tool (MCP) That’s Revolutionizing AI Development — No Coding Expertise Needed!

Mar 30

1.4K

How To Use LLMs To Turn English Instructions Into Executable SQL

Level Up Coding

Ahmed Besbes

How To Use LLMs To Turn English Instructions Into Executable SQL

An Overview of the Vanna Python Package

Oct 3, 2023

323

Introduction to Graphite — An Event Driven AI Agent Framework

Binome

Craig Li, Ph.D

Introduction to Graphite — An Event Driven AI Agent Framework

Graphite is an open-source framework for building domain-specific AI assistants using composable agentic workflows. It offers a highly…

5d ago

163

This new IDE from Google is an absolute game changer

Coding Beauty

Tari Ibaba

This new IDE from Google is an absolute game changer

This new IDE from Google is seriously revolutionary.

Mar 11

3.7K

200

Top 10 Open-Source AI Projects That Blew My Mind as a Developer

Let’s Code Future

Let's Code Future

Top 10 Open-Source AI Projects That Blew My Mind as a Developer

You need to checkout

Mar 29

434

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech