Back
Curriculum Vitae
Overview

Built for signal, not noise.

Five years of software engineering experience across consumer-scale systems, AI tooling, production delivery, and platform-minded backend work. I care about architectures that are measurable, grounded, and operationally realistic.

5
Years building production systems
1M+
Users supported at scale
54%
LLM cost reduction
92%
RAG factual precision
Experience

Professional Experience

Software Development Engineer

Large-Scale Consumer Platform · Greater Seattle Area

2021 — Present

Consumer Experience Platform

Led engineering on a 1M+ user product serving roughly 1,000 TPS during peak.

  • Built a Python forecasting pipeline that extracted endpoints, pulled traffic history from internal observability systems, and modeled TPS growth, cutting a four-month capacity planning process down to two months.
  • Drove a platform migration with percentage rollouts and URL-based state handoff, eliminating full page reloads and improving average page load latency by 40%.
  • Re-architected the experience from a legacy server-rendered UI stack to React + Spring Boot and introduced a config-driven rule engine so new device flows could launch through JSON configuration instead of core code changes.
  • Built an internal AI knowledge assistant using RAG and a custom MCP server, helping unblock onboarding and mentoring two interns through faster ramp-up.

Identity and Platform Services

Worked on identity-aware backend systems and AI-assisted developer workflows.

  • Built an identity-aware microservice on AWS Fargate for automated membership validation with secure cross-account isolation.
  • Led a cross-region migration for a Redis-backed device-awareness workload in EU infrastructure, writing the migration plan, provisioning the destination Lambda + Redis stack ahead of time, and coordinating phased traffic movement.
  • Used VPC peering, async dual-writes, and a one-day TTL overlap window to keep Redis state aligned across both regions, avoid meaningful latency regressions, and eliminate cold-start or cache-consistency issues during cutover.
  • Integrated an AI code review assistant into CI/CD with AWS Lambda and a managed cloud LLM platform, reducing manual review cycles by 25% through tiered prompt design.
  • Deployed an async job orchestration system for large-scale metadata sync with proactive CloudWatch alerting for authentication anomalies.
Projects

Selected Work

AI Reliability Copilot
2026
  • Built an incident-response copilot that turns raw Datadog, PagerDuty, Sentry, and on-call context into a structured nine-section reliability report.
  • Designed bilingual prompt flows, sample incident scenarios, eval surfaces, and knowledge-base affordances for repeatable SRE-style analysis.
  • Shipped a production Vercel demo with request limiting, JSON alert parsing, and a workflow optimized for fast triage during high-pressure incidents.
Live demo
Book Traveler / Shuzhongren
2026
  • Built an AI-driven interactive wuxia fiction app where readers create a character, enter a novel world, and steer irreversible story branches.
  • Designed a three-beat narrative engine that generates outlines, chapter direction, and major fate nodes while preserving a classical Chinese prose style.
  • Implemented archive-oriented product flows for starting, continuing, and eventually sharing personalized story runs across multiple source worlds.
Live demo
Admitly
2025
  • Orchestrated 16 task-tagged AI operations across four agent pipelines, including a 12-step deep research flow and a five-stage essay coaching system.
  • Built a hybrid RAG pipeline with semantic chunking, pgvector search, BM25, and reciprocal rank fusion, reaching 92% factual precision.
  • Implemented model routing across Opus, Sonnet, and Haiku to reduce per-pipeline LLM cost by 54%.
AI Financial Intelligence Agent
2024
  • Engineered a multi-tool reasoning agent using Claude function calling to chain five tools across news, filings, pricing, technicals, and sentiment.
  • Delivered structured investment signals with cited sources in under three seconds while processing 500+ articles per day.
  • Built an embedding-based clustering pipeline and few-shot sentiment classifier that reduced false positives by 45% and reached 79% precision on high-impact events.
Education

Academic Background

University of Pennsylvania
M.S. in Computer and Information Technology
Philadelphia, PA · Aug 2018 — Dec 2020
Sun Yat-Sen University
B.S. in Material Physics
Guangzhou, China · Aug 2013 — May 2017
Capabilities

Technical Coverage

AI Systems

RAG pipelinesMulti-agent orchestrationClaude APIManaged LLM platformsPrompt engineeringLLM evalsMCP serversEmbedding retrieval

Application Engineering

ReactNext.js App RouterReact NativeTypeScriptJavaSpring BootNode.jsFastAPI

Data & Infra

PostgreSQLpgvectorMongoDBDynamoDBRedisDockerKubernetesAWS LambdaECS/FargateCloudWatchGitHub Actions
Next Step

Want the conversational version?

You can also ask the AI chat about my experience, projects, or technical decisions. It answers from my actual resume and project content.