We present Operand Quant, a single-agent, IDE-based architecture for autonomous machine learning engineering (MLE). Operand Quant departs from conventional multi-agent orchestration frameworks by consolidating all MLE lifecycle stages -- exploration, modeling, experimentation, and deployment -- within a single, context-aware agent. On the MLE-Benchmark (2025), Operand Quant achieved a new state-of-the-art (SOTA) result, with an overall medal rate of 0.3956 +/- 0.0565 across 75 problems -- the highest recorded performance among all evaluated systems to date. The architecture demonstrates that a linear, non-blocking agent, operating autonomously within a controlled IDE environment, can outperform multi-agent and orchestrated systems under identical constraints.
翻译:本文提出操作数量化(Operand Quant),一种基于集成开发环境(IDE)的单智能体架构,用于自主机器学习工程(MLE)。该架构摒弃了传统的多智能体编排框架,将机器学习工程全生命周期——探索、建模、实验与部署——整合至一个具备上下文感知能力的单一智能体中。在MLE基准测试(2025)中,操作数量化取得了新的最优性能,在75个问题上总体奖牌率达到0.3956 ± 0.0565,这是迄今为止所有已评估系统中记录的最高性能。该架构证明,在受控的IDE环境中自主运行的线性非阻塞智能体,能够在相同约束条件下超越多智能体及编排系统。