I am Vo Minh Duc, a Senior Research Scientist at SB Intuitions, Japan, working on foundation models and multimodal generation. Feel free to reach out if you are interested in collaborating!
Annual grand challenge on document understanding with Vision-Language Models. 2026 edition extends to multilingual PDFs and evidence-grounded answering.
A high-signal AI paper reading community for engineers and researchers in Vietnam & Japan — deep-diving into papers that matter every two weeks.
1 paper accepted to NeurIPS 2024.
Attending ACL 2024 in Bangkok, Thailand.
Attending MIRU 2024 in Kumamoto, Japan.
1 paper accepted to ECCV 2024.
We will organize a workshop “Large Vision – Language Model Learning and Applications” at ACCV 2024.
Attending CVPR 2024 in Seattle, US.
Award: First Workshop on Test-Time Adaptation: Model, Adapt Thyself! (MAT), Community Track, CVPR 2024.
2 CVPR workshop paper, 1 main conference paper.
Award: 委員特別賞. 言語処理学会第30回年次大会(NLP2024).
1 paper accepted to 2024 IEEE CVPR workshop on fair, data-efficient, and trusted computer vision.
1 paper accepted to CVPR 2024.
JSPS Grant-in-Aid for Early-Career Scientists (KAKENHI) FY 2024 - 2025.
The National Institute of Informatics (Japan) collaborative research fund FY 2024.