ArXiv
51 mentions across all digests
ArXiv is a preprint server that hosts and distributes research papers in computer science and related fields, including work on content fuzzing, named entity recognition for low-resource languages, and LLM applications in enterprise and healthcare.
The Power of Power Law: Asymmetry Enables Compositional Reasoning
Power law asymmetry in neural network structures is fundamental to compositional reasoning, revealing why AI models can combine simple concepts into complex multi-step reasoning.
From Coarse to Fine: Self-Adaptive Hierarchical Planning for LLM Agents
Researchers introduce self-adaptive hierarchical planning for LLM agents, enabling coarse-to-fine refinement of action plans to improve complex task reasoning.
The Spectral Lifecycle of Transformer Training: Transient Compression Waves, Persistent Spectral Gradients, and the Q/K--V Asymmetry
Transient compression waves and persistent power-law spectral gradients propagate systematically through transformer layers during pretraining, revealing fundamental asymmetries between attention projection types that scale consistently from 30M to 285M parameters.
Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing
Stochastic KV routing cuts transformer inference memory overhead by dynamically sharing key-value caches across layers, enabling leaner LLM deployment without sacrificing quality.
Adaptive Test-Time Compute Allocation with Evolving In-Context Demonstrations
Researchers propose dynamically evolving in-context demonstrations during inference to adaptively allocate test-time compute and improve model efficiency without retraining.