Navigating AI's Frontier in 2025: Grace Isford, Lux Capital

1. Lux Capital and the AI Frontier

Grace Isford, a partner at Lux Capital, begins her presentation at the AI Engineer Summit about navigating AI's frontier in 2025. She introduces Lux Capital's philosophy of "believing before others understand" and investing in Frontier Tech ideas that "turn Sci-Fi into Sci-Fact."

Lux Capital has collaborated with AI companies at early stages including Hugging Face (the GitHub for ML), Together AI (open-source AI cloud), Physical Intelligence (robot software brain), and S AI (evolutionary, nature-inspired algorithm lab).
She highlights New York City as an AI hub, with many of Lux Capital's AI portfolio companies headquartered or maintaining a major presence there.

"Lux started in New York, and we're very bullish on AI opportunities in New York."

2. AI's Explosive Growth and 2025 Status

Grace looks back at AI's explosive growth since Stable Diffusion in August 2022. She evaluates the past 18 months' advances as "more aggressive, more impressive, and more widespread."

Key AI events of 2025:
- $500 billion Stargate Project: Collaboration between the US government, OpenAI, SoftBank, and Oracle.
- OpenAI's GPT-4.03: Surpassing human performance and achieving an AGI challenge.
- DeepSeek's R1 Model: Causing such impact that it affected Nvidia's stock price.
- France AI Summit: President Macron announcing European AI initiatives, re-engaging Europe in the AI race.

"2025 is the moment for AI agents. It's the perfect storm, but the lightning hasn't struck yet."

3. Current Limitations of AI Agents

Grace explains why AI agents aren't fully working yet, comparing it to "lightning that hasn't struck." She defines AI agents as "fully autonomous systems where LLMs decide actions on their own."

Real case: Flight booking failure
- Grace asked OpenAI Operator to book a flight from New York to San Francisco but it failed multiple times.
- First attempt: Couldn't find flights on Kayak.
- Second attempt: Recommended rush-hour flights on Skyscanner.
- It failed to properly reflect personal preferences (seat position, airline, price, etc.).

"AI fails because small cumulative errors compound. This goes beyond simple 'hallucination' problems."

4. Major Error Types in AI Agents

Grace classifies the main errors AI agents face into four types.

Decision Error: Choosing wrong facts.
- Example: Booking San Francisco, Peru instead of San Francisco, California.
Implementation Error: Wrong approach or integration issues.
- Example: Data flow disruption due to CAPTCHA problems.
Heuristic Error: Applying wrong criteria.
- Example: Not considering JFK airport traffic conditions.
Taste Error: Misunderstanding personal preferences.
- Example: Booking a Boeing 737 Max aircraft that Grace dislikes.

"AI does magical things, but small mistakes compound and fail to meet human expectations."

5. The Impact of Compounding Errors

Grace visually explains how compounding errors affect AI agent performance.

Comparing 99% vs. 95% accurate agents:
- After 50 steps, a 99% accurate agent drops to 60% accuracy.
- Compounding errors increase complexity even in simple tasks.

"Even a simple flight booking can become extremely complex due to compounding errors."

6. Five Strategies for Building AI Agents

Grace proposes five strategies to reduce compounding errors and improve AI agents.

Data Curation:
- Manage data quality to ensure AI agents have the information they need.
- "Data is the best asset, and curation makes it more effective."
- Example: Collecting Grace's travel preference data and reflecting it in real-time.
Evaluation Systems (Evals):
- Measure model responses and select correct answers.
- "Setting evaluation criteria for non-verifiable systems (e.g., personal preferences) is crucial."
Scaffolding Systems:
- Prevent a single error from affecting the entire system.
- Example: Ramp's infrastructure logic designed to prevent error propagation.
User Experience (UX):
- Improve UX to make AI agents better collaborators.
- "UX is the differentiating factor for AI apps. It should promote beautiful, elegant human-machine collaboration."
Multimodal Design:
- Integrate new modalities beyond text — voice, images, smell, touch.
- "We need to think beyond chatbot interfaces about making AI more human."

7. Conclusion: The Future of AI Agents

Grace emphasizes that while AI agents aren't yet perfect, they can be improved through data curation, evaluation, scaffolding, UX, and multimodal design.

"We're in the perfect storm for AI agents, but the lightning hasn't struck yet. I look forward to the innovative AI products you'll create."

This presentation focuses on identifying current limitations of AI agents and offering practical strategies for overcoming them while sharing a vision for AI's future. Grace's insightful proposals will provide great inspiration to AI developers and leaders.

1. Lux Capital and the AI Frontier

Lux Capital has collaborated with AI companies at early stages including Hugging Face (the GitHub for ML), Together AI (open-source AI cloud), Physical Intelligence (robot software brain), and S AI (evolutionary, nature-inspired algorithm lab).
She highlights New York City as an AI hub, with many of Lux Capital's AI portfolio companies headquartered or maintaining a major presence there.

"Lux started in New York, and we're very bullish on AI opportunities in New York."

2. AI's Explosive Growth and 2025 Status

Grace looks back at AI's explosive growth since Stable Diffusion in August 2022. She evaluates the past 18 months' advances as "more aggressive, more impressive, and more widespread."

Key AI events of 2025:
- $500 billion Stargate Project: Collaboration between the US government, OpenAI, SoftBank, and Oracle.
- OpenAI's GPT-4.03: Surpassing human performance and achieving an AGI challenge.
- DeepSeek's R1 Model: Causing such impact that it affected Nvidia's stock price.
- France AI Summit: President Macron announcing European AI initiatives, re-engaging Europe in the AI race.

"2025 is the moment for AI agents. It's the perfect storm, but the lightning hasn't struck yet."

3. Current Limitations of AI Agents

Real case: Flight booking failure
- Grace asked OpenAI Operator to book a flight from New York to San Francisco but it failed multiple times.
- First attempt: Couldn't find flights on Kayak.
- Second attempt: Recommended rush-hour flights on Skyscanner.
- It failed to properly reflect personal preferences (seat position, airline, price, etc.).

"AI fails because small cumulative errors compound. This goes beyond simple 'hallucination' problems."

4. Major Error Types in AI Agents

Grace classifies the main errors AI agents face into four types.

Decision Error: Choosing wrong facts.
- Example: Booking San Francisco, Peru instead of San Francisco, California.
Implementation Error: Wrong approach or integration issues.
- Example: Data flow disruption due to CAPTCHA problems.
Heuristic Error: Applying wrong criteria.
- Example: Not considering JFK airport traffic conditions.
Taste Error: Misunderstanding personal preferences.
- Example: Booking a Boeing 737 Max aircraft that Grace dislikes.

"AI does magical things, but small mistakes compound and fail to meet human expectations."

5. The Impact of Compounding Errors

Grace visually explains how compounding errors affect AI agent performance.

Comparing 99% vs. 95% accurate agents:
- After 50 steps, a 99% accurate agent drops to 60% accuracy.
- Compounding errors increase complexity even in simple tasks.

"Even a simple flight booking can become extremely complex due to compounding errors."

6. Five Strategies for Building AI Agents

Grace proposes five strategies to reduce compounding errors and improve AI agents.

Data Curation:
- Manage data quality to ensure AI agents have the information they need.
- "Data is the best asset, and curation makes it more effective."
- Example: Collecting Grace's travel preference data and reflecting it in real-time.
Evaluation Systems (Evals):
- Measure model responses and select correct answers.
- "Setting evaluation criteria for non-verifiable systems (e.g., personal preferences) is crucial."
Scaffolding Systems:
- Prevent a single error from affecting the entire system.
- Example: Ramp's infrastructure logic designed to prevent error propagation.
User Experience (UX):
- Improve UX to make AI agents better collaborators.
- "UX is the differentiating factor for AI apps. It should promote beautiful, elegant human-machine collaboration."
Multimodal Design:
- Integrate new modalities beyond text — voice, images, smell, touch.
- "We need to think beyond chatbot interfaces about making AI more human."

7. Conclusion: The Future of AI Agents

Grace emphasizes that while AI agents aren't yet perfect, they can be improved through data curation, evaluation, scaffolding, UX, and multimodal design.

"We're in the perfect storm for AI agents, but the lightning hasn't struck yet. I look forward to the innovative AI products you'll create."

1. Lux Capital and the AI Frontier

2. AI's Explosive Growth and 2025 Status

3. Current Limitations of AI Agents

4. Major Error Types in AI Agents

5. The Impact of Compounding Errors

6. Five Strategies for Building AI Agents

7. Conclusion: The Future of AI Agents

Related writing

The journey of AI tokens in the data center 🚀

Introducing TabFM: A zero-shot foundation model for…

Inside YC's AI Playbook

Reading

1. Lux Capital and the AI Frontier

2. AI's Explosive Growth and 2025 Status

3. Current Limitations of AI Agents

4. Major Error Types in AI Agents

5. The Impact of Compounding Errors

6. Five Strategies for Building AI Agents

7. Conclusion: The Future of AI Agents

Related writing

The journey of AI tokens in the data center 🚀

Introducing TabFM: A zero-shot foundation model for…

Inside YC's AI Playbook