Austin Wang: I think that’ll be interesting to look more into! The profile information does not convey overall accessibility since it’s normalized, but maybe this sort of multitasking could help.

Austin Wang austintwang.bsky.social · Dec 11, 2024
(1/10) Excited to announce our latest work! @arpita-s.bsky.social, @amanpatel100.bsky.social , and I will be presenting DART-Eval, a rigorous suite of evals for DNA Language Models on transcriptional regulatory DNA at #NeurIPS2024. Check it out! arxiv.org/abs/2412.05430
DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA

Recent advances in self-supervised models for natural language, vision, and protein sequences have inspired the development of large genomic DNA language models (DNALMs). These models aim to learn gen...

arxiv.org

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(2/10) DNALMs are a new class of self-supervised models for DNA, inspired by the success of LLMs. These DNALMs are often pre-trained solely on genomic DNA without considering any external annotations.

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(3/10) However, DNA is vastly different from text, being much more heterogeneous, imbalanced, and sparse. Imagine a blend of several different languages interspersed with a load of gibberish.

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(4/10) An effective DNALM should: • Learn representations that can accurately distinguish different types of functional DNA elements • Serve as a foundation for downstream supervised models • Outperform models trained from scratch

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(5/10) Rigorous evaluations of DNALMs, though critical, are lacking. Existing benchmarks: • Focus on surrogate tasks tenuously related to practical use cases • Suffer from inadequate controls and other dataset design flaws • Compare against outdated or inappropriate baselines

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(6/10) We introduce DART-Eval, a suite of five biologically informed DNALM evaluations focusing on transcriptional regulatory DNA ordered by increasing difficulty.

View on Bluesky Download image Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
(7/10) DNALMs struggle with more difficult tasks. Furthermore, small models trained from scratch (<10M params) routinely outperform much larger DNALMs (>1B params), even after LoRA fine-tuning! Our results on the hardest task - counterfactual variant effect prediction.

View on Bluesky Download image Show all post labels
Gonzalo Benegas handle.invalid · Dec 11, 2024
Really appreciate your paper. Could you clarify about whether ChromBPNet and DNALMs are trained on the same data? From the paper and code it might seem like only ChromBPNet is given profile-level labels.

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social · Dec 11, 2024
Thank you for the kind words! Yes, ChromBPNet uses unmodified models, which includes profile data and a bias model. However these evaluations use only the count head.

View on Bluesky Show all post labels
Gonzalo Benegas handle.invalid · Dec 11, 2024
Thanks! Do you think this could explain part of the gap between task 4 and 5? Could profile prediction help generalization to variants (of the count head)?

View on Bluesky Show all post labels
Austin Wang austintwang.bsky.social
I think that’ll be interesting to look more into! The profile information does not convey overall accessibility since it’s normalized, but maybe this sort of multitasking could help.
Dec 14, 2024 15:24
0 reposts 0 quotes 0 likes

View on Bluesky Show all post labels

DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA