Collinear Research

Explore our research papers developed in partnership with leading enterprises and universities

Assess, Guard, and Improve Your AI Systems with Security and Performance

We gave Claude, Gemini and GPT, $250k, and it didn't go as you’d expect...

5 min read
February 27, 2026

Spider: A lightweight on/off-policy distillation framework with a single client interface.

5 min read
November 19, 2025

The Valley of Code Reasoning: Scaling Knowledge Distillation of Large Language Models

8 min read
November 6, 2025

Impatient users confuse AI agents: high-fidelity simulations of human traits for testing agents

12 min read
September 29, 2025

Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models

12 min read
March 3, 2025

VERITAS: A Unified Approach to Reliability Evaluation

10 min read
November 4, 2024

Self-rationalization improves LLM as a fine-grained judge

10 min read
October 7, 2024

Better environments.
Better data. Better agents.

See what a thousand rollouts can teach your agent in 30 minutes.