arxiv:2605.20552
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
authored a paper 3 days ago
Spectral bandits for smooth graph functions with applications in recommender systems updated a dataset 3 days ago
misovalko/my-research-papers authored a paper about 1 month ago
Budgeted Online Influence Maximization