Open Seas 25 min read GPU Sizing for ML Workloads Learn to calculate VRAM requirements, select the right AWS instance, and optimize costs. Includes real benchmarks and a Python sizing calculator. #gpu #infrastructure #aws
Open Seas 30 min read Experiment Tracking with MLflow and Langfuse Set up experiment tracking for ML models with MLflow and LLM observability with Langfuse. Includes hyperparameter sweeps, model registry, and cost tracking. #mlflow #langfuse #mlops
Deep Dive 35 min read CI/CD for Machine Learning Build a complete ML pipeline with GitHub Actions: data validation, model training, automated testing, and staged deployment to production. #cicd #github-actions #mlops
Deep Dive 40 min read Model Serving on AWS Deploy ML models to production with optimized inference: torch.compile vs ONNX benchmarks, FastAPI serving patterns, and AWS deployment options. #inference #deployment #optimization
Deep Dive 35 min read ML Monitoring and Drift Detection Monitor production ML models with data drift detection, performance tracking, and automated alerting. Includes working Python implementations. #monitoring #drift #mlops
Deep Dive 30 min read ML Security Best Practices Secure your ML infrastructure with IAM roles, secrets management, VPC configuration, and input validation. Practical patterns for production systems. #security #iam #vpc
Deep Dive 45 min read Building an AI Trading Agent with Claude and News Signals Build an automated trading agent that extracts market signals from news using Claude Haiku, executes trades via Alpaca, and manages positions with trailing stops and sentiment monitoring. #claude #trading #alpaca
Deep Dive 35 min read Cross-Attention Fusion: Combining Text Embeddings with Structured Features Concatenation is the default. Here's why cross-attention works better for combining text embeddings with tabular data—and how to implement it in PyTorch. #multimodal #cross-attention #tabular
Deep Dive 25 min read DeepSeek V3.2: Frontier Reasoning at 6x Lower Cost Technical deep dive into DeepSeek V3.2's architecture: DeepSeek Sparse Attention (DSA), integrated reasoning with tool-use, and how it achieves IMO gold-medal performance. #deepseek #architecture #sparse-attention
Open Seas 20 min read Data Models for AI Applications: Pydantic vs Python Built-ins Compare Python's data modeling options for AI/ML applications. Learn when to use dataclasses, TypedDict, or Pydantic for API responses, embeddings metadata, and agent tool contracts. #pydantic #python #api
Open Seas 15 min read CFP Oracle: Semantic Search for College Football History Build a semantic search system to find historically similar College Football Playoff games using Amazon S3 Vectors and Bedrock embeddings. #s3-vectors #embeddings #semantic-search
Open Seas 25 min read Getting Started with Amazon S3 Vectors Build a semantic search system using AWS's new serverless vector storage. Store millions of embeddings in S3 with sub-second query times and serverless pricing. #s3 #vectors #embeddings
Deep Dive 25 min read Mamba for Predictive Maintenance: State Space Models vs Transformers Compare Mamba's selective state space architecture against LSTM and Transformer for hard drive failure prediction. Learn when SSMs beat attention. #mamba #state-space-models #predictive-maintenance
Open Seas 35 min read Build a Community Christmas Tree with AI-Generated Ornaments Create a shared Christmas tree where visitors add AI-generated ornaments using Amazon Nova Canvas, with defense-in-depth content moderation using Bedrock Guardrails and Claude. #nova-canvas #image-generation #aws-lambda
Open Seas 25 min read Build a Holiday Cocktail Agent with TheCocktailDB Create an AI bartender that suggests cocktails based on weather, searches by ingredient, and generates party menus with shopping lists. #agents #strands-sdk #cocktails
Open Seas 40 min read Building a Fishing Report Agent with AWS Strands Create an AI agent that combines tide, weather, and marine data to generate fishing reports. Learn tool-calling patterns with the Strands SDK, NOAA APIs, and Claude on AWS Bedrock. #strands #ai-agents #aws
Open Seas 30 min read Bi-Encoders: Fast Semantic Search at Scale Learn how bi-encoders enable sub-millisecond semantic search over millions of documents. Build a complete search system with sentence-transformers, FAISS indexing, and production-ready Python code. #sentence-transformers #semantic-search #faiss
Open Seas 25 min read Cross-Encoders: Precision Reranking for Search When bi-encoders aren't accurate enough, cross-encoders dramatically improve search relevance. Build a two-stage retrieval system with MS MARCO rerankers and sentence-transformers. #cross-encoder #reranking #semantic-search
Deep Dive 45 min read Visual Semantic Search with CLIP and AWS Build a production image search system using OpenAI's CLIP model, Amazon OpenSearch Serverless for vector storage, and Claude on Bedrock for image descriptions. Complete Python implementation with real AWS outputs. #clip #opensearch-serverless #bedrock
Calm Waters 25 min read Sentence Embeddings from Scratch with PyTorch Build a complete sentence encoder from the ground up. Learn tokenization, embedding layers, pooling strategies, and benchmark on semantic similarity. #pytorch #nlp #embeddings
Open Seas 45 min read Predicting Hard Drive Failures with XGBoost, LSTM, and Transformers Build a production-ready failure prediction system using real Backblaze data. Compare traditional ML vs deep learning approaches and learn when each shines. #xgboost #lstm #time-series
Deep Dive 25 min read DeepSeek R1 Architecture: How Reinforcement Learning Creates Reasoning Deep dive into DeepSeek R1's architecture: how pure RL training enables chain-of-thought reasoning, the GRPO algorithm, MoE design, and knowledge distillation. #deepseek #architecture #reinforcement-learning
Open Seas 20 min read Getting Started with DeepSeek R1: The Open-Source Reasoning Model Run DeepSeek R1 on AWS Bedrock or locally with Ollama. Learn to use its chain-of-thought reasoning for complex problem-solving, coding, and math. #deepseek #reasoning #bedrock