Daniel van Strien's picture

Building on HF

Daniel van Strien PRO

davanstrien

huggingface

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a Space about 8 hours ago

davanstrien/benchmark-race

updated a dataset about 13 hours ago

librarian-bots/model_cards_with_metadata

updated a dataset about 14 hours ago

librarian-bots/dataset-columns

View all activity

Organizations

upvoted an article 1 day ago

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

1 day ago

• 18

upvoted 2 articles 3 days ago

Article

Any Custom Frontend with Gradio's Backend

ysharma, abidlabs

•

Apr 1

• 37

Article

Migrating Your GitHub CI to Hugging Face Jobs

abidlabs

•

5 days ago

• 8

upvoted a paper 7 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 13 days ago • 52

upvoted an article 9 days ago

Article

Designing the hf CLI as an agent-optimized way to work with the Hub

celinah, Wauplin

•

10 days ago

• 56

upvoted a paper 10 days ago

Less Is More? When Dataset Context Hurts LLM-Generated Dataset Descriptions

Paper • 2606.02334 • Published 13 days ago • 1

upvoted an article 10 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

18 days ago

• 41

upvoted an article 12 days ago

Article

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

ibm-research

•

12 days ago

• 85

upvoted an article 16 days ago

Article

MONET: Lowering the Barrier to World Class Image Generation Research

jasperai

•

16 days ago

• 10

upvoted a collection 16 days ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 16 days ago • 11

upvoted a collection 18 days ago

MiniCPM5

A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 19 days ago • 26

upvoted an article 22 days ago

Article

Why Open Models Are the Only Sustainable Way to Teach AI

penelopegittos

•

22 days ago

• 8

upvoted a paper 22 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 26 days ago • 134

upvoted a collection 23 days ago

Hy-MT2

混元翻译模型2.0版本 • 11 items • Updated 19 days ago • 43

upvoted an article 26 days ago

Article

The Open Agent Leaderboard

ibm-research

•

26 days ago

• 13

upvoted a collection about 1 month ago

OCR models

11 items • Updated 23 days ago • 13

upvoted a paper about 1 month ago

Scaling Properties of Continuous Diffusion Spoken Language Models

Paper • 2604.24416 • Published Apr 27 • 1

upvoted an article about 1 month ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

Apr 29

• 79

upvoted an article about 2 months ago

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72

upvoted a collection about 2 months ago

Qwen3.6

4 items • Updated Apr 22 • 401