HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Mixture-of-Subspaces in Low-Rank Adaptation

Taiqiang Wu Jiahao Wang Zhe Zhao Ngai Wong

Mixture-of-Subspaces in Low-Rank Adaptation

Abstract

In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA)method, which is computationally efficient, easy to implement, and readilyapplicable to large language, multimodal, and diffusion models. Initially, weequivalently decompose the weights of LoRA into two subspaces, and find thatsimply mixing them can enhance performance. To study such a phenomenon, werevisit it through a fine-grained subspace lens, showing that such modificationis equivalent to employing a fixed mixer to fuse the subspaces. To be moreflexible, we jointly learn the mixer with the original LoRA weights, and termthe method Mixture-of-Subspaces LoRA (MoSLoRA). MoSLoRA consistentlyoutperforms LoRA on tasks in different modalities, including commonsensereasoning, visual instruction tuning, and subject-driven text-to-imagegeneration, demonstrating its effectiveness and robustness. Codes are availableat https://github.com/wutaiqiang/MoSLoRA{github}.

Code Repositories

wutaiqiang/moslora
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
common-sense-reasoning-on-arc-challengeLLaMA 3 8B + MoSLoRA (fine-tuned)
Accuracy: 81.5
common-sense-reasoning-on-arc-easyLLaMA 3 8B+MoSLoRA (fine-tuned)
Accuracy: 90.5
common-sense-reasoning-on-winograndeLLaMA3 8B+MoSLoRA
Accuracy: 85.8
question-answering-on-boolqLLaMA3+MoSLoRA
Accuracy: 74.6
question-answering-on-openbookqaLLaMA-3 8B+MoSLoRA
Accuracy: 86.8
question-answering-on-piqaLLaMA3 8B+MoSLoRA
Accuracy: 89.7
question-answering-on-social-iqaLLaMA-3 8B+MoSLoRA (fine-tuned)
Accuracy: 81.0
visual-question-answering-on-mm-vetInternLM2+ViT (QMoSLoRA)
GPT-4 score: 35.2
visual-question-answering-on-mm-vetLLaVA-InternLM2-7B-ViT + MoSLoRA
GPT-4 score: 35.2
visual-question-answering-on-mmbenchLLaVA-InternLM2-ViT + MoSLoRA
GPT-3.5 score: 73.8
visual-question-answering-on-mmbenchLLaVA-LLaMA3-8B-ViT + MoSLoRA
GPT-3.5 score: 73.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp