SciDataCon 2025

Name: SciDataCon 2025
Start: 2025-10-13T08:00:00+10:00
End: 2025-10-16T16:00:00+10:00
Location: Brisbane Convention & Exhibition Centre

13–16 Oct 2025

Brisbane Convention & Exhibition Centre

Australia/Brisbane timezone

SciDataCon Organisers

scidatacon@codata.org

Efficient Fine-Tuning of Visual Language Models for Chemical Image Understanding

13 Oct 2025, 18:00

1h 30m

Brisbane Convention & Exhibition Centre

Merivale St, South Brisbane QLD 410

Poster Data and Research Poster Session

Hyukjun Choi (Ajou University)

We investigate how vision-language models (VLMs) can be fine-tuned for chemistry-specific tasks by incorporating both molecular structure images and domain-specific textual descriptions. While general-purpose VLMs lack precision and adaptability in the chemical domain, our study addresses this gap through efficient fine-tuning strategies. In particular, we explore which selective layer tuning methods are most effective. Experimental evaluations using synthetic data and GPT-based assessment, in which accuracy scores were assigned based on the correctness of generated responses, reveal that tuning the Q (query) and V (value) modules of cross-attention layers yields the best performance. Our approach improves multimodal understanding in chemical contexts and presents a step toward lightweight, domain-adapted VLMs that are practical for scientific research and education.

Hyukjun Choi (Ajou University)

Ji Sun Hwang Juyoung Kang (Ajou university)

There are no materials yet.

SciDataCon 2025

SciDataCon Organisers

Efficient Fine-Tuning of Visual Language Models for Chemical Image Understanding

Brisbane Convention & Exhibition Centre

Speaker

Description

Primary author

Co-authors

Presentation materials

Choose timezone

SciDataCon 2025

SciDataCon Organisers

Speaker

Description

Primary author

Co-authors

Presentation materials