<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>largo.dev</title><description>Production AI tutorials on embeddings, transformers, retrieval, and deployment. Complete, runnable code.</description><link>https://largo.dev/</link><language>en-us</language><item><title>GPU Sizing for ML Workloads</title><link>https://largo.dev/tutorials/production-ml/gpu-sizing-for-ml/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/gpu-sizing-for-ml/</guid><description>Learn to calculate VRAM requirements, select the right AWS instance, and optimize costs. Includes real benchmarks and a Python sizing calculator.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Experiment Tracking with MLflow and Langfuse</title><link>https://largo.dev/tutorials/production-ml/experiment-tracking/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/experiment-tracking/</guid><description>Set up experiment tracking for ML models with MLflow and LLM observability with Langfuse. Includes hyperparameter sweeps, model registry, and cost tracking.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>CI/CD for Machine Learning</title><link>https://largo.dev/tutorials/production-ml/ml-cicd/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/ml-cicd/</guid><description>Build a complete ML pipeline with GitHub Actions: data validation, model training, automated testing, and staged deployment to production.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Model Serving on AWS</title><link>https://largo.dev/tutorials/production-ml/model-serving/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/model-serving/</guid><description>Deploy ML models to production with optimized inference: torch.compile vs ONNX benchmarks, FastAPI serving patterns, and AWS deployment options.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>ML Monitoring and Drift Detection</title><link>https://largo.dev/tutorials/production-ml/ml-monitoring/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/ml-monitoring/</guid><description>Monitor production ML models with data drift detection, performance tracking, and automated alerting. Includes working Python implementations.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>ML Security Best Practices</title><link>https://largo.dev/tutorials/production-ml/ml-security/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/ml-security/</guid><description>Secure your ML infrastructure with IAM roles, secrets management, VPC configuration, and input validation. Practical patterns for production systems.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>What It Takes to Be a Senior Machine Learning Engineer</title><link>https://largo.dev/articles/senior-mle-guide/</link><guid isPermaLink="true">https://largo.dev/articles/senior-mle-guide/</guid><description>A roadmap to the skills, knowledge, and practices that separate senior MLEs from the rest - with links to hands-on tutorials for each area.</description><pubDate>Sat, 24 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Building an AI Trading Agent with Claude and News Signals</title><link>https://largo.dev/tutorials/agents/trading-agent/</link><guid isPermaLink="true">https://largo.dev/tutorials/agents/trading-agent/</guid><description>Build an automated trading agent that extracts market signals from news using Claude Haiku, executes trades via Alpaca, and manages positions with trailing stops and sentiment monitoring.</description><pubDate>Sat, 17 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Cross-Attention Fusion: Combining Text Embeddings with Structured Features</title><link>https://largo.dev/tutorials/embeddings/cross-attention-text-tabular-fusion/</link><guid isPermaLink="true">https://largo.dev/tutorials/embeddings/cross-attention-text-tabular-fusion/</guid><description>Concatenation is the default. Here&apos;s why cross-attention works better for combining text embeddings with tabular data—and how to implement it in PyTorch.</description><pubDate>Wed, 14 Jan 2026 00:00:00 GMT</pubDate></item><item><title>DeepSeek V3.2: Frontier Reasoning at 6x Lower Cost</title><link>https://largo.dev/tutorials/transformers/deepseek-v3-architecture/</link><guid isPermaLink="true">https://largo.dev/tutorials/transformers/deepseek-v3-architecture/</guid><description>Technical deep dive into DeepSeek V3.2&apos;s architecture: DeepSeek Sparse Attention (DSA), integrated reasoning with tool-use, and how it achieves IMO gold-medal performance.</description><pubDate>Sat, 03 Jan 2026 00:00:00 GMT</pubDate></item><item><title>2026 Frontier LLM Architectures: MLA, iRoPE, mHC, and the Race for Efficiency</title><link>https://largo.dev/articles/frontier-llm-architectures-2026/</link><guid isPermaLink="true">https://largo.dev/articles/frontier-llm-architectures-2026/</guid><description>Technical comparison of DeepSeek V3.2, Llama 4, Gemini 3, and Qwen3 architectures—plus DeepSeek&apos;s mHC innovation expected in V4.</description><pubDate>Sat, 03 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Data Models for AI Applications: Pydantic vs Python Built-ins</title><link>https://largo.dev/tutorials/production-ml/data-models-for-ai-applications/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/data-models-for-ai-applications/</guid><description>Compare Python&apos;s data modeling options for AI/ML applications. Learn when to use dataclasses, TypedDict, or Pydantic for API responses, embeddings metadata, and agent tool contracts.</description><pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate></item><item><title>CFP Oracle: Semantic Search for College Football History</title><link>https://largo.dev/tutorials/retrieval-systems/cfp-oracle/</link><guid isPermaLink="true">https://largo.dev/tutorials/retrieval-systems/cfp-oracle/</guid><description>Build a semantic search system to find historically similar College Football Playoff games using Amazon S3 Vectors and Bedrock embeddings.</description><pubDate>Thu, 01 Jan 2026 00:00:00 GMT</pubDate></item><item><title>Getting Started with Amazon S3 Vectors</title><link>https://largo.dev/tutorials/retrieval-systems/s3-vectors-getting-started/</link><guid isPermaLink="true">https://largo.dev/tutorials/retrieval-systems/s3-vectors-getting-started/</guid><description>Build a semantic search system using AWS&apos;s new serverless vector storage. Store millions of embeddings in S3 with sub-second query times and serverless pricing.</description><pubDate>Wed, 31 Dec 2025 00:00:00 GMT</pubDate></item><item><title>2025: The Year AI Got a Reality Check</title><link>https://largo.dev/articles/2025-year-in-review/</link><guid isPermaLink="true">https://largo.dev/articles/2025-year-in-review/</guid><description>From DeepSeek&apos;s January bombshell to vibe coding going mainstream, here&apos;s what actually changed for AI practitioners in 2025.</description><pubDate>Wed, 31 Dec 2025 00:00:00 GMT</pubDate></item><item><title>Mamba for Predictive Maintenance: State Space Models vs Transformers</title><link>https://largo.dev/tutorials/production-ml/mamba-predictive-maintenance/</link><guid isPermaLink="true">https://largo.dev/tutorials/production-ml/mamba-predictive-maintenance/</guid><description>Compare Mamba&apos;s selective state space architecture against LSTM and Transformer for hard drive failure prediction. Learn when SSMs beat attention.</description><pubDate>Mon, 29 Dec 2025 00:00:00 GMT</pubDate></item><item><title>Build a Community Christmas Tree with AI-Generated Ornaments</title><link>https://largo.dev/tutorials/agents/christmas-tree-ornament-generator/</link><guid isPermaLink="true">https://largo.dev/tutorials/agents/christmas-tree-ornament-generator/</guid><description>Create a shared Christmas tree where visitors add AI-generated ornaments using Amazon Nova Canvas, with defense-in-depth content moderation using Bedrock Guardrails and Claude.</description><pubDate>Mon, 22 Dec 2025 00:00:00 GMT</pubDate></item><item><title>Build a Holiday Cocktail Agent with TheCocktailDB</title><link>https://largo.dev/tutorials/agents/holiday-cocktail-agent/</link><guid isPermaLink="true">https://largo.dev/tutorials/agents/holiday-cocktail-agent/</guid><description>Create an AI bartender that suggests cocktails based on weather, searches by ingredient, and generates party menus with shopping lists.</description><pubDate>Mon, 22 Dec 2025 00:00:00 GMT</pubDate></item><item><title>Building a Fishing Report Agent with AWS Strands</title><link>https://largo.dev/tutorials/agents/fishing-report-agent/</link><guid isPermaLink="true">https://largo.dev/tutorials/agents/fishing-report-agent/</guid><description>Create an AI agent that combines tide, weather, and marine data to generate fishing reports. Learn tool-calling patterns with the Strands SDK, NOAA APIs, and Claude on AWS Bedrock.</description><pubDate>Sun, 21 Dec 2025 00:00:00 GMT</pubDate></item><item><title>Bi-Encoders: Fast Semantic Search at Scale</title><link>https://largo.dev/tutorials/embeddings/bi-encoders-semantic-search/</link><guid isPermaLink="true">https://largo.dev/tutorials/embeddings/bi-encoders-semantic-search/</guid><description>Learn how bi-encoders enable sub-millisecond semantic search over millions of documents. Build a complete search system with sentence-transformers, FAISS indexing, and production-ready Python code.</description><pubDate>Sun, 21 Dec 2025 00:00:00 GMT</pubDate></item></channel></rss>