LOADING

Building a self-hosted LLM ecosystem in Rust: from infrastructure to applications

This talk explores building a complete self-hosted LLM stack in Rust: Paddler, a distributed load balancer for serving LLMs at scale, and Poet, a static site generator that consumes those LLMs for AI-powered content features.

Mateusz Charytoniuk
Software Architect
About This Talk

This talk explores building a complete self-hosted LLM stack in Rust: Paddler, a distributed load balancer for serving LLMs at scale, and Poet, a static site generator that consumes those LLMs for AI-powered content features.

We'll dive into the hard problems: async request routing across dynamic agent fleets, integrating with llama.cpp's C++ codebase, managing KV cache in custom slots, and implementing zero-to-N autoscaling with request buffering. You'll see how Rust's ownership model prevented entire classes of bugs in distributed state management, and walk away with concrete patterns for building and consuming LLM infrastructure in production.

more great talks

Might Be Interesting

Day 2
  —  
12:25 pm
arrow pointing right icon

Clean Code for Good Science: Rust in Research and Health

This talk explores what it means to write scientific software that lives up to the standards we expect of science itself.

Day 2
  —  
2:35 pm
arrow pointing right icon

Rust metaprogramming - macro_rules! beyond basics

I'll share a few tricks to help you write cleaner, more powerful declarative macros. You'll also get a sneak peek at the nightly features to see what's coming next macro_rules! world.

Day 1
  —  
5:20 pm
arrow pointing right icon

There are rats in my Cargo!!!

In this introductory talk, we will explore what it means to "Ratatuify" the Rust package manager, Cargo.

Day 1
  —  
3:10 pm
arrow pointing right icon

From Micrograd to coppergrad: Building Neural Networks and Backpropagation from Scratch in Rust

In this talk, we’ll re-create the core ideas of Karpathy’s micrograd, but entirely in Rust.

Day 2
  —  
11:15 am
arrow pointing right icon

Rust performance debugging with TUIs and LLMs

In my session, I will present the https://hotpath.rs crate and explain how it compares to other profiling tools available.

Day 1
  —  
2:35 pm
arrow pointing right icon

Blazingly Fast or Blazingly Hyped? A Reality Check on the RIIR Movement

This talk puts popular Rust rewrites to the test. We'll examine how these tools stack up against their battle-tested predecessors, looking at real-world performance, compilation times, binary sizes, feature completeness, and ecosystem maturity.

See All Events
Join us!

We're looking for amazing speakers.
CFP is open till 10.01.2023

Fill in Call for Papers
location icon

Location

Centrum Konferencyjne POLIN, Poland
stay in touch icon

Follow Us

Contact Us