LAB-001 — AI RESEARCH INTERFACE

OpenWebUI

Understanding Conversational AI

The interface powering our local LLM research. Exploring how conversational AI works, architectural inference decisions, and the imperative of localized data processing.

All Services

— Active Models

— Conversations

— Tokens/sec

— GPU Memory

Try It Live

Chat with your AI.

Connected

llama3.2:latest ▾

Hello! I'm running locally on your server. Ask me anything. Your data never leaves this machine. 🔒

Research Context

Why local inference?

Our deployment of OpenWebUI serves as the primary conduit for studying local neural architectures. By avoiding cloud endpoints, we eliminate the telemetry and data homogenization inherent in commercial APIs, allowing us to study unfiltered model behaviors and response patterns.

This architecture fundamentally changes the security paradigm: when the weights run locally on consumer-grade hardware (NVIDIA RTX series), the attack surface is reduced exclusively to the local network perimeter.

Key Citations

Vaswani, A., et al. (2017). "Attention is All You Need." Advances in Neural Information Processing Systems.

Touvron, H., et al. (2023). "LLaMA: Open and Efficient Foundation Language Models." arXiv preprint arXiv:2302.13971.

Benchmarked Architectures

Models in the lab.

LLM

Llama 3.2

Meta's latest open-source model. Exceptional reasoning and coding capabilities.

8BParameters

42tok/s

LLM

Mistral

Fast and efficient. Excellent for general-purpose chat and analysis.

7BParameters

55tok/s

CODE

DeepSeek Coder

Specialized for code generation, debugging, and technical writing.

6.7BParameters

60tok/s

VISION

LLaVA

Multimodal — understands images. Visual Q&A, OCR, and image analysis.

13BParameters

28tok/s

EMBED

Nomic Embed

Text embeddings for RAG. Semantic search over your documents.

137MParameters

1200tok/s

SAFETY

Custom Fine-tuned

Your own fine-tuned models. Upload LoRA adapters for specialized tasks.

AnySize

∞Custom

Features

Built for power users.

Multi-Model Chat

Switch between models mid-conversation. Compare responses. Use the right model for each task.

RAG & Documents

Upload PDFs, docs, and code. Chat with your data. Full retrieval-augmented generation.

Plugin System

Extend with web search, code execution, image generation, and custom tools.

Full Privacy

Every conversation is encrypted at rest. No telemetry. No external API calls unless you choose to.

Ready to chat
with your AI?

Open the full interface and start a conversation with your self-hosted models.

Online