Tutorials · largo.dev

GPU Sizing for ML Workloads

Learn to calculate VRAM requirements, select the right AWS instance, and optimize costs. Includes real benchmarks and a Python sizing calculator.

#gpu #infrastructure #aws

Open Seas 30 min read

Experiment Tracking with MLflow and Langfuse

Set up experiment tracking for ML models with MLflow and LLM observability with Langfuse. Includes hyperparameter sweeps, model registry, and cost tracking.

#mlflow #langfuse #mlops

Deep Dive 35 min read

CI/CD for Machine Learning

Build a complete ML pipeline with GitHub Actions: data validation, model training, automated testing, and staged deployment to production.

#cicd #github-actions #mlops

Deep Dive 40 min read

Model Serving on AWS

Deploy ML models to production with optimized inference: torch.compile vs ONNX benchmarks, FastAPI serving patterns, and AWS deployment options.

#inference #deployment #optimization

Deep Dive 35 min read

ML Monitoring and Drift Detection

Monitor production ML models with data drift detection, performance tracking, and automated alerting. Includes working Python implementations.

#monitoring #drift #mlops

Deep Dive 30 min read

ML Security Best Practices

Secure your ML infrastructure with IAM roles, secrets management, VPC configuration, and input validation. Practical patterns for production systems.

#security #iam #vpc

Deep Dive 45 min read

Building an AI Trading Agent with Claude and News Signals

Build an automated trading agent that extracts market signals from news using Claude Haiku, executes trades via Alpaca, and manages positions with trailing stops and sentiment monitoring.

#claude #trading #alpaca

Deep Dive 35 min read

Cross-Attention Fusion: Combining Text Embeddings with Structured Features

Concatenation is the default. Here's why cross-attention works better for combining text embeddings with tabular data—and how to implement it in PyTorch.

#multimodal #cross-attention #tabular

Deep Dive 25 min read

DeepSeek V3.2: Frontier Reasoning at 6x Lower Cost

Technical deep dive into DeepSeek V3.2's architecture: DeepSeek Sparse Attention (DSA), integrated reasoning with tool-use, and how it achieves IMO gold-medal performance.

#deepseek #architecture #sparse-attention

Open Seas 20 min read

Data Models for AI Applications: Pydantic vs Python Built-ins

Compare Python's data modeling options for AI/ML applications. Learn when to use dataclasses, TypedDict, or Pydantic for API responses, embeddings metadata, and agent tool contracts.

#pydantic #python #api

Open Seas 15 min read

CFP Oracle: Semantic Search for College Football History

Build a semantic search system to find historically similar College Football Playoff games using Amazon S3 Vectors and Bedrock embeddings.

#s3-vectors #embeddings #semantic-search

Open Seas 25 min read

Getting Started with Amazon S3 Vectors

Build a semantic search system using AWS's new serverless vector storage. Store millions of embeddings in S3 with sub-second query times and serverless pricing.

#s3 #vectors #embeddings

Deep Dive 25 min read

Mamba for Predictive Maintenance: State Space Models vs Transformers

Compare Mamba's selective state space architecture against LSTM and Transformer for hard drive failure prediction. Learn when SSMs beat attention.

#mamba #state-space-models #predictive-maintenance

Open Seas 35 min read

Build a Community Christmas Tree with AI-Generated Ornaments

Create a shared Christmas tree where visitors add AI-generated ornaments using Amazon Nova Canvas, with defense-in-depth content moderation using Bedrock Guardrails and Claude.

#nova-canvas #image-generation #aws-lambda

Open Seas 25 min read

Build a Holiday Cocktail Agent with TheCocktailDB

Create an AI bartender that suggests cocktails based on weather, searches by ingredient, and generates party menus with shopping lists.

#agents #strands-sdk #cocktails

Open Seas 40 min read

Building a Fishing Report Agent with AWS Strands

Create an AI agent that combines tide, weather, and marine data to generate fishing reports. Learn tool-calling patterns with the Strands SDK, NOAA APIs, and Claude on AWS Bedrock.

#strands #ai-agents #aws

Open Seas 30 min read

Bi-Encoders: Fast Semantic Search at Scale

Learn how bi-encoders enable sub-millisecond semantic search over millions of documents. Build a complete search system with sentence-transformers, FAISS indexing, and production-ready Python code.

#sentence-transformers #semantic-search #faiss

Open Seas 25 min read

Cross-Encoders: Precision Reranking for Search

When bi-encoders aren't accurate enough, cross-encoders dramatically improve search relevance. Build a two-stage retrieval system with MS MARCO rerankers and sentence-transformers.

#cross-encoder #reranking #semantic-search

Deep Dive 45 min read

Visual Semantic Search with CLIP and AWS

Build a production image search system using OpenAI's CLIP model, Amazon OpenSearch Serverless for vector storage, and Claude on Bedrock for image descriptions. Complete Python implementation with real AWS outputs.

#clip #opensearch-serverless #bedrock

Calm Waters 25 min read

Sentence Embeddings from Scratch with PyTorch

Build a complete sentence encoder from the ground up. Learn tokenization, embedding layers, pooling strategies, and benchmark on semantic similarity.

#pytorch #nlp #embeddings

Open Seas 45 min read

Predicting Hard Drive Failures with XGBoost, LSTM, and Transformers

Build a production-ready failure prediction system using real Backblaze data. Compare traditional ML vs deep learning approaches and learn when each shines.

#xgboost #lstm #time-series

Deep Dive 25 min read

DeepSeek R1 Architecture: How Reinforcement Learning Creates Reasoning

Deep dive into DeepSeek R1's architecture: how pure RL training enables chain-of-thought reasoning, the GRPO algorithm, MoE design, and knowledge distillation.

#deepseek #architecture #reinforcement-learning

Open Seas 20 min read

Getting Started with DeepSeek R1: The Open-Source Reasoning Model

Run DeepSeek R1 on AWS Bedrock or locally with Ollama. Learn to use its chain-of-thought reasoning for complex problem-solving, coding, and math.

#deepseek #reasoning #bedrock