stuff i never have time to read - a mattsta Collection

mattsta 's Collections

stuff i never have time to read

stuff i never have time to read

updated about 3 hours ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98
Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Paper • 2402.11984 • Published Feb 19, 2024
BlackGoose Rimer: Harnessing RWKV-7 as a Simple yet Superior Replacement for Transformers in Large-Scale Time Series Modeling

Paper • 2503.06121 • Published Mar 8, 2025 • 5
Timer: Transformers for Time Series Analysis at Scale

Paper • 2402.02368 • Published Feb 4, 2024 • 2
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting

Paper • 2410.04803 • Published Oct 7, 2024 • 2
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24, 2024 • 16
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17, 2025 • 51
One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23, 2025 • 63
Ming-Omni: A Unified Multimodal Model for Perception and Generation

Paper • 2506.09344 • Published Jun 11, 2025 • 33
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

Paper • 2502.05431 • Published Feb 8, 2025 • 6
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7, 2025 • 68
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 128
Qwen3-ASR Technical Report

Paper • 2601.21337 • Published Jan 29 • 38
Mercury: Ultra-Fast Language Models Based on Diffusion

Paper • 2506.17298 • Published Jun 17, 2025 • 11