Wade Deng

Experience

MaybeAI — CTO / Co-Founder

2025/01 – Present · AI Agent Platform

AI agent platform for automated online workflows — planning, reasoning, MCP tool usage
Harness engineering for AI-native execution systems
Clients with 100M+ GMV, improved AI-driven operating efficiency

XCelsiorAI — CTO / Co-Founder

2024/06 – 2025/01 · AI Productivity

AI-native spreadsheet product — LLMs prompted directly inside Google Sheets & Excel
Won 1,000 pilot customers in early product phase

Footprint Analytics — CTO / Co-Founder

2021/10 – 2024/06 · Web3 Data Platform

Web3 big data platform indexing 30+ chains for 12k+ analysts and 50+ B2B customers
Led 25-person engineering team across platform, data & product
PB-scale ETL: Kafka, Flink, Spark, Doris, StarRocks · sub-second API latency
Open-source growth: 0 → 12k users

Infigo Technology — IT Manager

2019/01 – 2022/04 · Global Digital Lending

Credit scoring AI for Latin America & SE Asia operations
Model AUC: 0.63 → 0.79 · KS: 0.21 → 0.34
AI lending ops in Mexico, India, China, Colombia

Core Skills

AI / LLM Credit Scoring Anti-Fraud Computer Vision Python TensorFlow / PyTorch FastAPI Node.js Kafka / Flink / Spark Doris / StarRocks MCP Harness Engineering Team Leadership (25p)

Tech Talks & Presentations

🤖 AI / LLM

Anthropic MCP — Model Context Protocol

Open in Google Slides ↗

Navigating the Era of Large Models

Open in Google Slides ↗

📊 Modern Big Data

⛓️ On-chain Data Analysis

Web3 Data Analytics Platform — Project Examples

Open in Google Slides ↗

Wallet Profile / Modeling with On-chain Data

Open in Google Slides ↗

Tracking Crypto Cashflow — Stable Coin Example

Open in Google Slides ↗

☁️ Modern Cloud & DevOps (2016–2020)

System Security Status — Security Best Practices

Open in Google Slides ↗

HAProxy Configuration — Load Balancing & Logging

Open in Google Slides ↗

🛠️ Modern Languages & Frameworks (2014–2016)

Golang — Language Overview

Open in Google Slides ↗

Design Patterns in JavaScript — Architecture Patterns

Open in Google Slides ↗

JSConf 2014 Hangzhou — Conference Insights

Open in Google Slides ↗

⚙️ Backend Infrastructure (2013–2015)

Jenkins CI & Unit Testing — Continuous Integration

Open in Google Slides ↗

Backend Development Training — Best Practices

Open in Google Slides ↗

Grunt for Backend — Task Automation

Open in Google Slides ↗

📱 Mobile Development Era (2012–2014)

PhoneGap Basics — Mobile Development

Open in Google Slides ↗

🔄 Modern Development Practices

Git Workflow — Version Control Best Practices

Open in Google Slides ↗

Agile Development — Methodology Overview

Open in Google Slides ↗

Context Engineering

The new skill in AI isn't prompting — it's Context Engineering. As AI agents become mainstream, the difference between a cheap demo and a "magical" agent is the quality of context you provide.

What is Context Engineering?

Context Engineering is the discipline of designing and building dynamic systems that provide the right information and tools, in the right format, at the right time, so an LLM can accomplish a task.

Think of it as everything the model "sees" before it generates a response — not just the prompt string, but a complete system output.

What Makes Up "Context"?

Instructions / System Prompt — initial behavior definition, rules, examples
User Prompt — immediate task or question
Short-term Memory — current conversation history
Long-term Memory — user preferences, project summaries, persistent facts
RAG (Retrieved Info) — external knowledge from docs, DBs, APIs
Available Tools — function definitions the agent can call
Structured Output — schema definitions for the response format

Prompt Eng vs Context Eng

Prompt Engineering focuses on crafting the perfect instruction string. Context Engineering is a system that runs before the LLM call — it's dynamic, task-specific, and covers both knowledge and capabilities.

Most agent failures today are not model failures — they're context failures. The agent doesn't have what it needs.

Why It Matters for SE / Data People

Imagine you ask an AI: "Check if we have inventory for SKU-123 and send an order confirmation if we do."

A "cheap demo" agent sees only the user message and outputs a robotic reply. A "magical" agent enriches context before calling the LLM: your calendar (shows you're busy), past emails (informal tone), contact list (key partner), and tools like check_inventory and send_email. The output becomes genuinely useful.

For data engineers: context engineering means your data pipeline, schema definitions, and domain knowledge all need to be legible to the AI at the right moment — not dumped as a massive doc, but surfaced dynamically.

Harness Engineering

OpenAI's term for the discipline of designing environments, feedback loops, and control systems that help AI agents accomplish complex, reliable software work at scale.

The Core Idea: Humans Steer, Agents Execute

OpenAI's team built a product with 0 lines of manually-written code. Over 5 months, Codex wrote ~1 million lines across application logic, tests, CI, docs, and tooling — at roughly 1/10th the time it would have taken by hand.

The key shift: engineering work moved from writing code to designing environments, specifying intent, and building feedback loops. Humans steer. Agents execute.

What Broke (and Why)

Early progress was slower than expected — not because Codex was incapable, but because the environment was underspecified. The agent lacked tools, abstractions, and internal structure to make progress.

The fix was almost never "try harder." Instead: "What capability is missing, and how do we make it both legible and enforceable for the agent?"

The Map, Not the Encyclopedia

One big AGENTS.md fails predictably: context is crowded out, everything becomes "important," the file rots, and drift is inevitable.

Instead: a short AGENTS.md (~100 lines) acts as a table of contents. The real knowledge lives in a structured docs/ directory treated as the system of record. Progressive disclosure — teach the agent where to look, don't overwhelm it up front.

Key Patterns from Harness Engineering

Agent legibility is the goal — anything the agent can't access in-context while running effectively doesn't exist. Knowledge in Google Docs or chat threads = invisible to the system.
Enforce architecture with invariants, not micromanagement — strict boundaries and predictable structure let agents ship fast without undermining the foundation.
Feedback loops as first-class infrastructure — testing, validation, review, and recovery are encoded directly into the system, not done manually.
"Garbage collection" for AI-generated code — technical debt is high-interest debt. Pay it down continuously in small increments rather than letting patterns compound.
Custom lints with remediation instructions — encode error messages to inject fix instructions directly into agent context.

Claude Code & Codex in Practice

Both Claude Code and OpenAI Codex are harness engineering tools — they take a repository context and drive a development loop: design → code → review → test → deploy. The quality of your AGENTS.md, docs structure, and feedback tooling determines how effective they are.

For SE/data teams: integrating these tools means investing in documentation hygiene, structured knowledge bases, and fast feedback loops (CI that gives agents signal they can act on).

What This Means for Your Team

Throughput changes merge philosophy: when agent output far exceeds human attention, corrections are cheap and waiting is expensive. Minimal blocking merge gates. Short-lived PRs. Test flakes addressed with follow-up runs.

Human judgment is still required — but it operates at a higher layer of abstraction: prioritize work, translate feedback into acceptance criteria, validate outcomes.

Experience

MaybeAI — CTO / Co-Founder

XCelsiorAI — CTO / Co-Founder

Footprint Analytics — CTO / Co-Founder

Infigo Technology — IT Manager

Core Skills

Tech Talks & Presentations

🤖 AI / LLM

Anthropic MCP — Model Context Protocol

Navigating the Era of Large Models

📊 Modern Big Data

Big Data 101/102 — Intro

Big Data 201/202 — Parallel Computing

Big Data 301 — Hands-on Spark

Big Data 303 — SQL Technique

Big Data 304 — Spark in Deep (S01)

Big Data 305 — DBT

Big Data 401 — Spark in Deep (S02)

Big Data 402 — Streaming

Blockchain Analytics with Apache Doris — Singapore Meetup

⛓️ On-chain Data Analysis

Web3 Data Analytics Platform — Project Examples

Wallet Profile / Modeling with On-chain Data

Tracking Crypto Cashflow — Stable Coin Example

☁️ Modern Cloud & DevOps (2016–2020)

System Security Status — Security Best Practices

HAProxy Configuration — Load Balancing & Logging

🛠️ Modern Languages & Frameworks (2014–2016)

Golang — Language Overview

Design Patterns in JavaScript — Architecture Patterns

JSConf 2014 Hangzhou — Conference Insights

⚙️ Backend Infrastructure (2013–2015)

Jenkins CI & Unit Testing — Continuous Integration

Backend Development Training — Best Practices

Grunt for Backend — Task Automation

📱 Mobile Development Era (2012–2014)

PhoneGap Basics — Mobile Development

🔄 Modern Development Practices

Git Workflow — Version Control Best Practices

Agile Development — Methodology Overview

Context Engineering

What is Context Engineering?

What Makes Up "Context"?

Prompt Eng vs Context Eng

Why It Matters for SE / Data People

Harness Engineering

The Core Idea: Humans Steer, Agents Execute

What Broke (and Why)

The Map, Not the Encyclopedia

Key Patterns from Harness Engineering

Claude Code & Codex in Practice

What This Means for Your Team