brianletort.ai

My Stack & Lab

The tools, frameworks, and hardware I use daily to build agentic AI systems. I love writing code—early mornings, weekends, whenever the ideas are flowing.

Home Lab Philosophy

I run enough GPUs in my home office to trip circuit breakers on a standard circuit. My NVIDIA RTX 5090 and Digits handle fine-tuning, inference testing, and rapid prototyping without cloud costs or latency. When you're iterating on multi-agent architectures and context engineering, fast local feedback loops are everything.

Hardware & Compute
  • NVIDIA RTX 5090 (32GB)

    Primary GPU for fine-tuning and local inference

  • NVIDIA Digits

    Compact AI supercomputer for development

  • M4 Pro MacBooks

    Daily development machines with Apple Silicon

  • Local NAS

    Terabytes of training data and knowledge bases

Development Environment
  • Cursor

    AI-native code editor—my daily driver

  • VS Code

    Notebooks and specific workflows

  • Warp / iTerm

    Modern terminal for heavy CLI work

  • GitHub

    Version control and collaboration

AI Coding Assistants
  • Claude Sonnet / Opus

    Primary AI assistant for complex reasoning

  • OpenAI Codex / GPT-4

    Code generation and analysis

  • GitHub Copilot

    Inline completions and suggestions

  • Local LLMs

    Ollama, vLLM for private experimentation

Cloud & Infrastructure
  • Azure

    Enterprise cloud and AI services

  • Vercel

    Frontend deployment and edge functions

  • Supabase

    Postgres + real-time for rapid prototyping

  • n8n

    Workflow automation and agent orchestration

AI/ML Platforms
  • OpenAI API

    GPT-4, embeddings, and function calling

  • Anthropic Claude

    Complex reasoning and long-context tasks

  • Azure OpenAI

    Enterprise deployments with compliance

  • Hugging Face

    Model hub and specialized models

RAG & Agent Frameworks
  • LlamaIndex

    Data framework for RAG applications

  • LangChain / LangGraph

    LLM orchestration and multi-agent flows

  • Semantic Kernel

    Microsoft's AI orchestration SDK

  • DSPy

    Programmatic prompt optimization

Vector Databases
  • Weaviate

    Open-source vector search with hybrid capabilities

  • pgvector

    Vector search in Postgres—simple and effective

  • Pinecone

    Managed vector database for production

  • Chroma

    Lightweight local vector store for prototyping

Python & Deep Learning
  • PyTorch

    Primary framework for model development

  • Transformers (HF)

    State-of-the-art NLP models

  • scikit-learn

    Classical ML and preprocessing

  • Weights & Biases

    Experiment tracking and observability

Frontend & Web
  • Next.js

    React framework—App Router for everything

  • TypeScript

    Type safety for complex applications

  • Tailwind CSS

    Utility-first styling

  • shadcn/ui

    Beautiful, accessible component library

  • Framer Motion

    Smooth animations and micro-interactions

Currently Experimenting With

12-15 Agent RAG Pipelines

Production

Specialized agents for retrieval optimization at terabyte scale

Context Engineering Techniques

Research

Memory optimization to overcome attention dilution

Plan-Act-Learn Pipelines

Prototyping

Self-learning ETL that adapts autonomously

Real-Time Hallucination Detection

Development

Quality feedback loops with continuous learning

Local Fine-Tuning Workflows

Development

LoRA and QLoRA on consumer GPUs for domain adaptation

Headless SaaS Patterns

Exploration

Agent-first API design for the post-GUI era

Research Focus Areas
  • Multi-agent parallel architectures for retrieval optimization
  • Recency bias and attention dilution mitigation
  • Reinforcement learning for RAG response quality
  • Forward-thinking entity-linking for feature spaces
  • Autonomous data pipeline adaptation
  • Agent-to-agent communication protocols

Open Source & Community

I'm a huge believer in open source. Most of my stack is built on open frameworks—LlamaIndex, LangChain, Semantic Kernel, Weaviate, pgvector. These communities move faster than any single company. I love building with Next.js, shadcn, and the modern JavaScript ecosystem. When the ideas are flowing, I'm writing code—whether it's tweaking RAG pipelines or fine-tuning models on my local GPUs.