Wenqi Glantz – Medium

Wenqi Glantz

Pinned

An Overview of My Blog Posts

Learning by Blogging

Jul 26, 2023

An Overview of My Blog Posts

Jul 26, 2023

Published in
TDS Archive

The Journey of RAG Development: From Notebook to Microservices

Converting a Colab notebook to two microservices with support for Milvus and NeMo Guardrails

Feb 21, 2024

The Journey of RAG Development: From Notebook to Microservices

Feb 21, 2024

Published in
TDS Archive

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

Exploring NeMo Guardrails’ practical use cases

Feb 9, 2024

NeMo Guardrails, the Ultimate Open-Source LLM Security Toolkit

Feb 9, 2024

Published in
TDS Archive

12 RAG Pain Points and Proposed Solutions

Solving the core challenges of Retrieval-Augmented Generation

Jan 30, 2024

12 RAG Pain Points and Proposed Solutions

Jan 30, 2024

Published in
TDS Archive

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

Exploring robust RAG development with LlamaPacks, Lighthouz AI, and Llama Guard

Jan 29, 2024

Jump-start Your RAG Pipelines with Advanced Retrieval LlamaPacks and Benchmark with Lighthouz AI

Jan 29, 2024

Published in
TDS Archive

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

My observations from experimenting with model merge, evaluation, and two model fine-tuning techniques

Jan 19, 2024

Exploring mergekit for Model Merge, AutoEval for Model Evaluation, and DPO for Model Fine-tuning

Jan 19, 2024

Published in
TDS Archive

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

A deep dive into model quantization with GGUF and llama.cpp and model evaluation with LlamaIndex

Jan 15, 2024

Democratizing LLMs: 4-bit Quantization for Optimal LLM Inference

Jan 15, 2024

Published in
TDS Archive

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

A step-by-step guide on deploying LlamaIndex RAGs to AWS ECS fargate

Jan 8, 2024

Deploying LLM Apps to AWS, the Open-Source Self-Service Way

Jan 8, 2024

Published in
TDS Archive

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

How to add Llama Guard to your RAG pipelines to moderate LLM inputs and outputs and combat prompt injection

Dec 27, 2023

Safeguarding Your RAG Pipelines: A Step-by-Step Guide to Implementing Llama Guard with LlamaIndex

Dec 27, 2023

Published in
Level Up Coding

10+ Ways to Run Open-Source Models with LlamaIndex

LlamaIndex’s open-source model integration with Hugging Face, vLLM, Ollama, Llama.cpp, liteLLM, Replicate, Gradient, and more

Dec 19, 2023

10+ Ways to Run Open-Source Models with LlamaIndex

Dec 19, 2023

Wenqi Glantz

Wenqi Glantz

Friend of Medium

Mom, wife, architect with a passion for technology and crafting quality products linkedin.com/in/wenqi-glantz-b5448a5a/ twitter.com/wenqi_glantz

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech