Rustikon Talks

Building a self-hosted LLM ecosystem in Rust: from infrastructure to applications

This talk explores building a complete self-hosted LLM stack in Rust: Paddler, a distributed load balancer for serving LLMs at scale, and Poet, a static site generator that consumes those LLMs for AI-powered content features.

Mateusz Charytoniuk

Software Architect

About This Talk

This talk explores building a complete self-hosted LLM stack in Rust: Paddler, a distributed load balancer for serving LLMs at scale, and Poet, a static site generator that consumes those LLMs for AI-powered content features.

We'll dive into the hard problems: async request routing across dynamic agent fleets, integrating with llama.cpp's C++ codebase, managing KV cache in custom slots, and implementing zero-to-N autoscaling with request buffering. You'll see how Rust's ownership model prevented entire classes of bugs in distributed state management, and walk away with concrete patterns for building and consuming LLM infrastructure in production.

March 20, 2026

9:00 am

See Full Schedule Make an Appointment

more great talks

Might Be Interesting

Day 1

—

3:10 pm

From Micrograd to coppergrad: Building Neural Networks and Backpropagation from Scratch in Rust

In this talk, we’ll re-create the core ideas of Karpathy’s micrograd, but entirely in Rust.

Blazingly Fast or Blazingly Hyped? A Reality Check on the RIIR Movement

This talk puts popular Rust rewrites to the test. We'll examine how these tools stack up against their battle-tested predecessors, looking at real-world performance, compilation times, binary sizes, feature completeness, and ecosystem maturity.

To impl or not to impl: The State of Existential Types in Rust

Rust metaprogramming - macro_rules! beyond basics

Beyond println!: Mastering Rust Debugging

This talk explains how Rust debugging actually works: how compiler-generated debuginfo (DWARF/PDB) maps binaries back to source, and how LLDB/GDB interpret that data in practice.

AI Agents in an Iron Cage: How Rust ensures safety and performance in production

The talk explores how Rust’s type system and memory safety can be leveraged to enforce mandatory guardrails at the infrastructure level, where traditional frameworks often fall short.

Kostiantyn Mysnyk

See All Events

Join us!

We're looking for amazing speakers.
CFP is open till 10.01.2023

Fill in Call for Papers

Location

Centrum Konferencyjne POLIN, Poland

Email

scalar@scalar-conf.com

Building a self-hosted LLM ecosystem in Rust: from infrastructure to applications

Mateusz Charytoniuk

About This Talk

Might Be Interesting

From Micrograd to coppergrad: Building Neural Networks and Backpropagation from Scratch in Rust

Blazingly Fast or Blazingly Hyped? A Reality Check on the RIIR Movement

To impl or not to impl: The State of Existential Types in Rust

Rust metaprogramming - macro_rules! beyond basics

Beyond println!: Mastering Rust Debugging

AI Agents in an Iron Cage: How Rust ensures safety and performance in production

Join us!

We're looking for amazing speakers.
CFP is open till 10.01.2023

Location

Email

Follow Us

Rustikon

Navigation

Other

Get your ticket

Building a self-hosted LLM ecosystem in Rust: from infrastructure to applications

Mateusz Charytoniuk

About This Talk

Might Be Interesting

From Micrograd to coppergrad: Building Neural Networks and Backpropagation from Scratch in Rust

Blazingly Fast or Blazingly Hyped? A Reality Check on the RIIR Movement

To impl or not to impl: The State of Existential Types in Rust

Rust metaprogramming - macro_rules! beyond basics

Beyond println!: Mastering Rust Debugging

AI Agents in an Iron Cage: How Rust ensures safety and performance in production

Join us!

We're looking for amazing speakers. CFP is open till 10.01.2023

Location

Email

Follow Us

Rustikon

Navigation

Other

We're looking for amazing speakers.
CFP is open till 10.01.2023