News
Talks
Ep. 45: John Yang, SWE-Bench Lead Author and Stanford CS PhD Student
2025-11-17 • Delta Institute | Ankit Gupta
2025-11-17 • Delta Institute | Ankit Gupta
AI Evals w: John Yang: Evaluating and training software engineering agents
2025-10-17 • alphaXiv x Vals AI
2025-10-17 • alphaXiv x Vals AI
SWE-smith: Scaling Data for Software Engineering Agents | John Yang | Stanford University
2025-08-12 • Open AGI Summit
2025-08-12 • Open AGI Summit
Few Shot Code Generation to Autonomous Software Engineering Agents // John Yang
2024-12-02 • MLOps.community
2024-12-02 • MLOps.community
SWE-bench with John Yang and Carlos E. Jimenez - Weaviate Podcast #107!
2024-10-30 • Weaviate Podcast | Connor Shorten
2024-10-30 • Weaviate Podcast | Connor Shorten
John Yang - SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
2023-11-03 • University of Toronto | Rohan Alexander
2023-11-03 • University of Toronto | Rohan Alexander
Press
Laude Institute Announces First Batch of Slingshots AI Grants
2025-11-06 • TechCrunch | Russell Brandom
2025-11-06 • TechCrunch | Russell Brandom
Stanford and Alibaba Build Bug-Fixing Dataset and Pipeline to Train AI
2025-08-13 • DeepLearning.AI | Andrew Ng
2025-08-13 • DeepLearning.AI | Andrew Ng
A new AI coding challenge just published its first results — and they aren't pretty
2025-07-23 • TechCrunch | Russell Brandom
2025-07-23 • TechCrunch | Russell Brandom
AI Models Still Struggle to Debug Software, Microsoft Study Shows
2025-04-10 • TechCrunch | Kyle Wiggers
2025-04-10 • TechCrunch | Kyle Wiggers
Coding Agents Are Evolving From Novelties to Widely Useful Tools
2024-06-19 • DeepLearning.AI | Andrew Ng
2024-06-19 • DeepLearning.AI | Andrew Ng
AI Agent Automatically Codes WITH TOOLS - SWE-Agent Tutorial ("Devin Clone")
2024-04-05 • Matthew Berman
2024-04-05 • Matthew Berman