Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
90.5
TFLOPS
100
22
170
Bram Vanroy
PRO
BramVanroy
Follow
edilson09's profile picture
stefan-it's profile picture
umuthopeyildirim's profile picture
246 followers
·
167 following
https://bramvanroy.github.io/
BramVanroy
BramVanroy
bramvanroy
bramvanroy.bsky.social
AI & ML interests
Artificial intelligence, natural language processing, computational linguistics
Recent Activity
liked
a dataset
1 day ago
GPT-NL/DuidelijkeTaal-v1.0-split
liked
a dataset
4 days ago
nvidia/Nemotron-Personas-France
reacted
to
yuriyvnv
's
post
with 🚀
9 days ago
🎯 WAVe-1B-Multimodal-NL: Word-Level Speech Quality Assessment for Dutch Following the release of the Portuguese model, we're releasing the Dutch variant of WAVe — a 1B multimodal embedding model that assesses synthetic speech quality at the word level, thereby improving the quality of synthetically augmented datasets for training ASR models. Trained on CommonVoice 16.1 Dutch with 5 corruption strategies, this model catches mispronunciations, timing errors, and prosody issues in synthetic data that sentence-level embeddings miss entirely. Resources - Dutch model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-NL - Portuguese model: https://huggingface.co/yuriyvnv/WAVe-1B-Multimodal-PT - Code: https://github.com/yuriyvnv/WAVe This model builds on CommonVoice Dutch data — thanks to @mozilla and the CommonVoice community for making multilingual speech data accessible. Would be great to hear from the Dutch NLP community — @BramVanroy @GroNLP — especially if you're working on Dutch ASR or TTS pipelines where quality filtering could help. Also tagging @hf-audio as this sits at the intersection of speech processing and data curation.
View all activity
Organizations
BramVanroy
's datasets
45
Sort: Recently updated
BramVanroy/finewiki-nl-30-to-24k-tokens
Viewer
•
Updated
Dec 18, 2025
•
821k
•
53
•
1
BramVanroy/finemath-4plus-seqlen36k
Viewer
•
Updated
Dec 5, 2025
•
2.85M
•
66
•
1
BramVanroy/synthetic-uner-ner-200-Qwen3-14B-AWQ
Viewer
•
Updated
Nov 26, 2025
•
200
•
35
BramVanroy/synthetic-uner-ner
Viewer
•
Updated
Nov 25, 2025
•
61.5k
•
83
BramVanroy/synthetic-uner-ner-20000-Qwen3-14B-AWQ
Viewer
•
Updated
Nov 24, 2025
•
20k
•
43
BramVanroy/universal_ner
Viewer
•
Updated
Nov 24, 2025
•
77.9k
•
337
BramVanroy/conll2002
Viewer
•
Updated
Nov 14, 2025
•
35.7k
•
59
BramVanroy/conll2003
Viewer
•
Updated
Nov 14, 2025
•
20.7k
•
614
•
1
BramVanroy/dutch-edu-classifier-training-v3
Viewer
•
Updated
Sep 4, 2025
•
274k
•
24
BramVanroy/CommonCrawl-CreativeCommons
Viewer
•
Updated
Aug 28, 2025
•
739M
•
434
•
34
BramVanroy/CommonCrawl-CreativeCommons-fine
Viewer
•
Updated
Aug 28, 2025
•
75.1M
•
378
•
2
BramVanroy/CommonCrawl-CreativeCommons-strict
Viewer
•
Updated
Aug 28, 2025
•
32.8M
•
95
•
1
BramVanroy/dutch-edu-classifier-training-v2
Viewer
•
Updated
Aug 18, 2025
•
500k
•
33
BramVanroy/dutch-edu-classifier-training
Viewer
•
Updated
Aug 14, 2025
•
744k
•
20
BramVanroy/fineweb-duckdbs
Updated
May 15, 2025
•
1.63k
•
1
BramVanroy/fineweb-2-duckdbs
Updated
Apr 28, 2025
•
4.57k
BramVanroy/fw2-nl-qwen2_5-72b-50k-merged-split
Viewer
•
Updated
Apr 25, 2025
•
95.7k
•
5
•
1
BramVanroy/belebele_dutch
Viewer
•
Updated
Apr 25, 2025
•
1.8k
•
18
BramVanroy/finewebs-copyright-domains
Viewer
•
Updated
Mar 26, 2025
•
361
•
3
•
1
BramVanroy/WildChat-1M-filtered-gpt-4
Viewer
•
Updated
Feb 17, 2025
•
136k
•
11
BramVanroy/fw2-nl-rm-qwen2_5-72b-50k
Viewer
•
Updated
Jan 20, 2025
•
50k
•
5
BramVanroy/fw2-nl-qwen2_5-72b-50k
Viewer
•
Updated
Jan 20, 2025
•
50k
•
8
BramVanroy/wikipedia_culturax_dutch
Viewer
•
Updated
Dec 23, 2024
•
1.3B
•
4.14k
•
6
BramVanroy/ultra_feedback_dutch
Viewer
•
Updated
Dec 6, 2024
•
53.6k
•
117
•
3
BramVanroy/no_robots_dutch
Viewer
•
Updated
Dec 6, 2024
•
8.61k
•
64
•
2
BramVanroy/ultra_feedback_dutch_cleaned
Viewer
•
Updated
Dec 6, 2024
•
183k
•
149
•
6
BramVanroy/orca_dpo_pairs_dutch_cleaned
Viewer
•
Updated
Dec 6, 2024
•
31.6k
•
80
•
3
BramVanroy/orca_dpo_pairs_dutch
Viewer
•
Updated
Dec 6, 2024
•
11k
•
40
•
6
BramVanroy/ultrachat_200k_dutch
Viewer
•
Updated
Dec 6, 2024
•
214k
•
113
•
8
BramVanroy/lmsys-20240814-nl
Viewer
•
Updated
Oct 21, 2024
•
2.75k
•
8
Previous
1
2
Next