HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search

Sun Jintao ; Fei Hao ; Zheng Zhedong ; Ding Gangyi

From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for
  Efficient Text-based Person Search

Abstract

In text-based person search endeavors, data generation has emerged as aprevailing practice, addressing concerns over privacy preservation and thearduous task of manual annotation. Although the number of synthesized data canbe infinite in theory, the scientific conundrum persists that how muchgenerated data optimally fuels subsequent model training. We observe that onlya subset of the data in these constructed datasets plays a decisive role.Therefore, we introduce a new Filtering-WoRA paradigm, which contains afiltering algorithm to identify this crucial data subset and WoRA (WeightedLow-Rank Adaptation) learning strategy for light fine-tuning. The filteringalgorithm is based on the cross-modality relevance to remove the lots of coarsematching synthesis pairs. As the number of data decreases, we do not need tofine-tune the entire model. Therefore, we propose a WoRA learning strategy toefficiently update a minimal portion of model parameters. WoRA streamlines thelearning process, enabling heightened efficiency in extracting knowledge fromfewer, yet potent, data instances. Extensive experimentation validates theefficacy of pretraining, where our model achieves advanced and efficientretrieval performance on challenging real-world benchmarks. Notably, on theCUHK-PEDES dataset, we have achieved a competitive mAP of 67.02% while reducingmodel training time by 19.82%.

Code Repositories

JT-Sun/Filtering-WoRA
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
nlp-based-person-retrival-on-cuhk-pedesFiltering-WoRA(Small)
R@1: 76.38
R@10: 93.49
R@5: 89.72
mAP: 67.22
text-based-person-retrieval-on-icfg-pedesFiltering-WoRA(Small)
R@1: 68.35
R@10: 87.53
R@5: 83.10
mAP: 42.60
text-based-person-retrieval-on-rstpreid-1Filtering-WoRA(Small)
R@1: 66.85
R@10: 91.10
R@5: 86.45
mAP: 52.49

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp