|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
|
Automl For Complex High Dimensional Data
|
|
Real Time Time Series Forecasting With Streaming D
|
💬 Commented on 🧠 [Research] Fundamental Equation of Consciousness: Ψ = argm in Stability-AI/generative-models (2026-06-26)
💬 Commented on [BUG] <Technical Issue Report: Qwen Model Recommends Physica in QwenLM/Qwen (2026-06-26)
💬 Commented on 本地部署深求v4 pro的1.6万亿参数. 祝大家创业成功! in deepseek-ai/DeepSeek-V3 (2026-06-26)
💬 Commented on [BUG] @mlflow/opencode traces are logged but session metadat in mlflow/mlflow (2026-06-26)
💬 Commented on feature: OpenAI tools param support for non-passthrough guar in NVIDIA-NeMo/Guardrails (2026-06-26)
💬 Commented on Face Swap Workflow Not Working (Yolov10m not working) in Comfy-Org/ComfyUI (2026-06-26)
💬 Commented on [data] Autoscaler uses raw ExecutionResources , causing asse in ray-project/ray (2026-06-25)
💬 Commented on Qwen3‑VL on‑policy KD silently trains "blind" with sglang 0. in songmzhang/KDFlow (2026-06-18)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 AutoML for Complex, High-Dimensional Data
🔬 Real-Time Data Quality Monitoring for ML Systems
🔬 Fine-Tuning LLMs with Parameter-Efficient Methods (LoRA/QLoRA) at Scale
🔬 Retrieval-Augmented Generation (RAG) with Streaming Data
🔬 Real-Time Time Series Forecasting with Streaming Data
🔬 Optimizing On-Device AI with Quantization and Distillation for Edge Deployment
📌 Embedding Cache with LRU Eviction — Production Pattern (Python) (2026-06-25)
📌 Agent Tool Registry with Dynamic Discovery — Production Pattern (Python) (2026-06-23)
📌 Vector Similarity Search with FAISS — Production Pattern (Python) (2026-06-22)
🤖 Profile auto-updated on 2026-06-26 19:57 UTC


