15 7

Александр Петров

tmp-123

AI & ML interests

None yet

Recent Activity

liked a dataset 14 minutes ago

chaitanya-yadav/vehicle-predictive-maintenance

upvoted a paper 1 day ago

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

liked a dataset 2 days ago

MHuangX/LAION-Beyond

View all activity

Organizations

None yet

liked a dataset 14 minutes ago

chaitanya-yadav/vehicle-predictive-maintenance

Updated 28 minutes ago

upvoted a paper 1 day ago

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Paper • 2604.00830 • Published 9 days ago • 12

liked a dataset 2 days ago

MHuangX/LAION-Beyond

Preview • Updated about 14 hours ago • 7.84k

upvoted a paper 3 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 8 days ago • 248

upvoted a paper 6 days ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published 12 days ago • 35

liked a dataset 7 days ago

legacy-datasets/wikipedia

Updated Mar 11, 2024 • 84.9k • 618

upvoted a paper 9 days ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 11 days ago • 339

liked a model 9 days ago

mulemp/kcworld

Updated 4 days ago

upvoted 2 papers 10 days ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published 17 days ago • 61

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 21 days ago • 330

upvoted a paper 14 days ago

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published 15 days ago • 117

upvoted a paper 23 days ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published 25 days ago • 152

upvoted a paper 26 days ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

liked 2 models about 1 month ago

zai-org/GLM-5

Text Generation • 754B • Updated 5 days ago • 392k • • 1.97k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 3.34M • • 13.2k

upvoted 5 papers about 2 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 519

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 216

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 208

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 244

Александр Петров

AI & ML interests

Recent Activity

Organizations

tmp-123's activity