Full-stack technologist who ships products. 25 years building software that generates revenue, reduces costs, and runs for years. Now AI-enhanced β I build with LLMs, RAG systems, and agentic tools to ship faster and smarter.
I'm an expert generalist: I go deep where needed, see the whole picture, and move fluidly across domains.
π€ AI Applications
- RAG architectures, vector search, multi-LLM integration (Anthropic, OpenAI, Ollama)
- Evaluation frameworks, prompt engineering, streaming interfaces
- Agentic development patterns and context engineering
β‘ Full-Stack Products
- React, Next.js, TypeScript, Python, Node.js, C#, Java, CFML
- PostgreSQL, SQL Server, cloud databases
- API design, system architecture, legacy modernization
βοΈ Platform & Infrastructure
- AWS (Certified AI Practitioner), Cloudflare Workers, Vercel
- DevSecOps, observability (DataDog), cost optimization
- Zero-downtime migrations, disaster recovery
| Project | Description | Stack |
|---|---|---|
| Vercel RAG Demo | Streaming chatbot with semantic search | Next.js 16, pgvector, OpenAI |
| Cloudflare RAG Demo | Edge-deployed, on-device LLM inference | Workers, Llama-3.1-8b, Vectorize |
| stevenleve.com | Professional site, 96% security score | Astro, Cloudflare Workers |
- chatgpt-data-extractor β Production RAG app with multi-LLM support and vector search
- intel-gpu-llm-inference β Open-source LLM benchmarking on Intel Arc GPUs
- rag_wiki_demo β Jupyter evaluation lab with custom metrics framework
17 years at ShareASale/Awin Global β built products that processed $1B+ in annual transactions:
- Recruitment CRM generating $10M/year in commissions
- Email systems delivering 2M+ messages/month
- Tracking infrastructure handling 250M monthly requests
- Zero-downtime AWS migration saving $20K/month
Currently: Fractional CTO for a MarTech startup | Open to interesting problems
π LinkedIn β’ π stevenleve.com