ResearchHub | Open Science Community

HD

Haixing Dai

Author with expertise in Autofocusing in Microscopy and Photography

Achievements

This user has not unlocked any achievements yet.

Key Stats

Upvotes received:

0

Publications:

4

(25% Open Access)

Cited by:

0

h-index:

12

/

i10-index:

15

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

Show more

How is this calculated?

Publications

Mask-Guided Vision Transformer for Few-Shot Learning

Yuzhong Chen et al.Jan 1, 2024

Learning with little data is challenging but often inevitable in various application scenarios where the labeled data are limited and costly. Recently, few-shot learning (FSL) gained increasing attention because of its generalizability of prior knowledge to new tasks that contain only a few samples. However, for data-intensive models such as vision transformer (ViT), current fine-tuning-based FSL approaches are inefficient in knowledge generalization and, thus, degenerate the downstream task performances. In this article, we propose a novel mask-guided ViT (MG-ViT) to achieve an effective and efficient FSL on the ViT model. The key idea is to apply a mask on image patches to screen out the task-irrelevant ones and to guide the ViT focusing on task-relevant and discriminative patches during FSL. Particularly, MG-ViT only introduces an additional mask operation and a residual connection, enabling the inheritance of parameters from pretrained ViT without any other cost. To optimally select representative few-shot samples, we also include an active learning-based sample selection method to further improve the generalizability of MG-ViT-based FSL. We evaluate the proposed MG-ViT on classification, object detection, and segmentation tasks using gradient-weighted class activation mapping (Grad-CAM) to generate masks. The experimental results show that the MG-ViT model significantly improves the performance and efficiency compared with general fine-tuning-based ViT and ResNet models, providing novel insights and a concrete approach toward generalizing data-intensive and large-scale deep learning models for FSL.

Artificial Intelligence

Media Technology

0

Paper

Artificial Intelligence

Media Technology

Save

BI-AVAN: A Brain-Inspired Adversarial Visual Attention Network for Characterizing Human Visual Attention from Neural Activity

Heng Huang et al.Jan 1, 2024

Artificial Intelligence

Cognitive Neuroscience

0

Paper

Artificial Intelligence

Cognitive Neuroscience

Save

Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications

Saed Rezayi et al.Jan 1, 2024

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Artificial General Intelligence for Medical Imaging Analysis

Xiang Li et al.Nov 7, 2024

Artificial Intelligence

0

Paper

Artificial Intelligence

Save