ijcrms logo

A Unified Multi-Modal Mixture-of-Experts Model for Integrated Representation Learning in Pharmaceutical Sciences

Authors
  • Anushree Bhople

    Author

Keywords:
Multi-Modal Learning, Mixture-Of-Experts, Pharmaceutical Sciences, Representation Learning, Large Language Models, Bioinformatics
Abstract

The rapid advancement of large language models (LLMs) has opened up opportunities for AI applications in pharmaceutical sciences. However, integrating diverse biological data modalities continues to remain challenging.  We propose SciMind, a multi-modal mixture-of-experts (MoE) having the capability of integrated representation learning from pharmaceutical data sources, including biomedical text, DNA sequence, protein sequence, and molecular structure. The proposed method includes method-specific tokenization strategies, sparse expert routing mechanisms, and cross-modal pre-training for improved knowledge transfer across multiple biological representations. An expert initialization approach based on limited K-means and adaptive top-k routing can use the parameters effectively while preserving domain-specific knowledge. Through experimental evaluations across four applications, including biomedical natural language processing, molecular understanding, promoter prediction and protein-related tasks, SciMind achieves competitive performance against existing domain-specific and general-purpose models.  According to the results, unified multi-modal learning can assist in the representation quality, drug reasoning capabilities, and applications in drug acceptance, molecule analysis, and personalized medicine.

References
Cover Image
Downloads
Published
2026-06-13
Section
Articles
License

Copyright (c) 2026 International Journal of Clinical Research and Medical Sciences

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.