Spec Kit Agents: Context-Grounded Agentic Workflows

Pardis Taghavi, Santosh Bhavani
4/7/2026
cs.SEcs.AIcs.MA

Abstract

Spec-driven development (SDD) with AI coding agents provides a structured workflow, but agents often remain "context blind" in large, evolving repositories, leading to hallucinated APIs and architectural violations. We present Spec Kit Agents, a multi-agent SDD pipeline (with PM and developer roles) that adds phase-level, context-grounding hooks. Read-only probing hooks ground each stage (Specify, Plan, Tasks, Implement) in repository evidence, while validation hooks check intermediate artifacts against the environment. We evaluate 128 runs covering 32 features across five repositories. Context-grounding hooks improve judged quality by +0.15 on a 1-5 composite LLM-as-judge score (+3.0 percent of the full score; Wilcoxon signed-rank, p < 0.05) while maintaining 99.7-100 percent repository-level test compatibility. We further evaluate the framework on SWE-bench Lite, where augmentation hooks improve baseline by 1.7 percent, achieving 58.2 percent Pass@1.

View on arXivView PDF

Code Implementations(9)

This repository is the Developer Kit for Claude Code - a modular plugin system providing reusable skills, agents, and commands for automating development tasks. The kit includes independent plugins covering Java/Spring Boot/LangChain4J, TypeScript/NestJS/React, Python, PHP/WordPress, AWS CloudFormation, and AI patterns. Designed for multi-CLI.

1188Oct 21, 20251 months agoMIT
agentic-codeagentic-codingagentsawsaws-cloudformation+12 more

Production-ready Next.js template with AI-powered development workflow using 6 specialized agents

233633Jan 12, 20262 weeks ago

Open-source Agentic AI framework in Go for building, orchestrating, and deploying intelligent agents. LLM-agnostic, event-driven, with multi-agent workflows, MCP tool discovery, and production-grade observability.

13125May 2, 20251 weeks agoApache-2.0
agentic-aiagentic-ai-developmentagentic-codingagentic-frameworkagentic-rag+13 more

Modular framework for autonomous AI agents that interact with blockchain protocols, execute transactions, and coordinate multi-agent workflows. Supports EVM (Base, Ethereum L2s) and Solana.

122121Oct 9, 20251 months ago
agentaiblockchainkiton-chain

Production-tested templates for deploying multi-agent AI teams on OpenClaw with Telegram supergroup integration. 10 agent personalities, shared context workflows, bot-to-bot communication, and step-by-step AI-readable setup instructions. Built from a live 10-agent setup.

31368Mar 4, 20262 weeks agoMIT

Agentic RAG vector starter kit with AI coding agent architecture, Langchain, LanceDB, and Backblaze B2 for document upload, semantic search, grounded chat, and document citations.

01Mar 10, 20263 weeks agoMIT
agentic-aiagentic-codingagentic-frameworkagentic-ragagentic-workflow+9 more

MSc thesis project: An HR virtual assistant using Google’s Agent Developer Kit (ADK), with modular agent architecture and Vertex AI Search for secure, document-grounded responses.

00May 27, 20259 months ago

A web-based Flask app built using Google Agent Development Kit (ADK) that deploys two AI agents — a NuGenomics Customer Support Agent grounded in official FAQs using Retrieval-Augmented Generation (RAG), and a Genetic Wellness Agent providing general wellness insights via LLMs. Built for the ADK Agent Development Challenge , October 2025.

00Oct 31, 20255 months ago

Getting images with annotations, manually setting up agent's categories and creating CSV files with camera and objects ground truth, global coordinate data.

10Sep 13, 20241 years ago

Discussion