Infrastructure 9 items

Everything Infrastructure

🐙 GitHub Apr 23

future-agi/future-agi: Open-source, end-to-end platform for evaluating, observing, and improving LLM and AI agent applications. Tracing · Evals · Simulations · Datasets · Gateway · Guardrails. Self-hostable. Apache 2.0.

Self-hostable, Apache 2.0-licensed platform covering the full LLM application observability and improvement loop: tracing, evals, simulations, datasets, gateway, and guardrails in one stack. Targets teams who want an integrated alternative to stitching together Langfuse, LangSmith, and separate guardrail layers. Open-source with enterprise-grade feature breadth.

Evaluation Agents Infrastructure Safety

🟧 Hacker News Apr 22

Parallel agents in Zed

Zed editor adds support for running multiple AI agents in parallel within the same workspace, allowing concurrent agentic tasks on different parts of a codebase. No content snippet is available, but the feature extends Zed's existing AI coding capabilities to multi-agent workflows. Relevant for teams evaluating editor-native agent orchestration versus external tooling.

Agents Code Gen Tooling Infrastructure

💬 Reddit Apr 22

PSA: Anthropic bans organizations without warning

An ~110-user agricultural tech org had all Claude accounts suspended simultaneously without prior warning, with no admin notification and only a Google Form for appeal. The post raises legitimate concerns about Anthropic's enterprise account governance: no escalation path, no advance notice, and no SLA on appeal response. A real operational risk for teams with Claude in production workflows.

Deployment Safety Infrastructure

🧠 DeepMind Apr 22

Decoupled DiLoCo: A new frontier for resilient, distributed AI training

Google DeepMind proposes Decoupled DiLoCo, an extension of the DiLoCo distributed training framework designed for resilient training across heterogeneous or unreliable compute. No content snippet available beyond the title, but DiLoCo variants address the core challenge of large-scale training without tight synchronization.

Training Infrastructure Distributed-training

🟢 OpenAI Apr 22

Introducing workspace agents in ChatGPT

OpenAI introduces workspace agents in ChatGPT: Codex-powered cloud agents that can automate multi-step workflows across tools on behalf of teams. They run asynchronously in the cloud, scoped to a workspace with access controls. This extends Codex beyond single-shot code generation into persistent, team-level agentic task execution.

Agents Code Gen Infrastructure Deployment

🟢 OpenAI Apr 22

Speeding up agentic workflows with WebSockets in the Responses API

OpenAI engineering post details how the Codex agent loop uses WebSockets in the Responses API to reduce per-request connection overhead and leverages connection-scoped caching to cut model latency in multi-turn agentic workflows. The post quantifies improvements but frames them around the specific Codex loop design. Practical reference for anyone building low-latency agents on top of the Responses API.

Agents Inference Infrastructure Deployment

🐙 GitHub Apr 20

cosmicstack-labs/mercury-agent: Soul-driven AI agent with permission-hardened tools, token budgets, and multi-channel access. Runs 24/7 from CLI or Telegram.

CLI/Telegram-accessible AI agent framework with permission-scoped tools, token budget enforcement, and 24/7 uptime. Packages a "soul" config (personality/behavioral constraints) alongside access control primitives. Thin on novel technical depth — primarily a structured agent harness.

Agents Tooling Infrastructure

🐙 GitHub Apr 16

TheArcForge/UniClaude: Claude Code, natively inside Unity Editor. A dockable chat window with full project awareness, 60+ MCP tools, and zero alt-tabbing.

UniClaude embeds Claude Code directly into the Unity Editor as a dockable chat window, giving it full project awareness and access to 60+ MCP tools without leaving the editor. Targets the context-switching friction that plagues game dev AI workflows. Essentially a Unity-native MCP client wired to Claude.

Agents Tooling Code Gen Infrastructure

🟢 OpenAI Apr 16

⭐ Editor's Pick

OpenAI Agents SDK next evolution: native sandbox execution, model-native harness, configurable memory

OpenAI's Agents SDK gained native sandbox execution and a model-native harness (April 16) for secure, long-running file/tool agents, plus configurable memory and sandbox-aware orchestration. Version 0.4 (April 5) added MCP tool-use and streaming agent handoffs, making SDK-built agents first-class MCP consumers alongside Claude and Cursor. The combined updates meaningfully close the gap between the SDK and production-grade agent frameworks.

Agents Infrastructure Tooling Deployment