ResearchHub | Open Science Community

JZ

Jing Zhang

Author with expertise in Visual Object Tracking and Person Re-identification

Achievements

Cited Author

Key Stats

Upvotes received:

0

Publications:

6

(17% Open Access)

Cited by:

185

h-index:

21

/

i10-index:

35

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

Show more

How is this calculated?

Publications

Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis

Xi Liang et al.Mar 20, 2019

Objects in unmanned aerial vehicle (UAV) images are generally small due to the high-photography altitude. Although many efforts have been made in object detection, how to accurately and quickly detect small objects is still one of the remaining open challenges. In this paper, we propose a feature fusion and scaling-based single shot detector (FS-SSD) for small object detection in the UAV images. The FS-SSD is an enhancement based on FSSD, a variety of the original single shot multibox detector (SSD). We add an extra scaling branch of the deconvolution module with an average pooling operation to form a feature pyramid. The original feature fusion branch is adjusted to be better suited to the small object detection task. The two feature pyramids generated by the deconvolution module and feature fusion module are utilized to make predictions together. In addition to the deep features learned by the FS-SSD, to further improve the detection accuracy, spatial context analysis is proposed to incorporate the object spatial relationships into object redetection. The interclass and intraclass distances between different object instances are computed as a spatial context, which proves effective for multiclass small object detection. Six experiments are conducted on the PASCAL VOC dataset and the two UAV image datasets. The experimental results demonstrate that the proposed method can achieve a comparable detection speed but an accuracy superior to those of the six state-of-the-art methods.

Artificial Intelligence

0

Paper

Save

Domain adaptation with optimized feature distribution for streamer action recognition in live video

Chen He et al.Jun 4, 2024

Artificial Intelligence

0

Paper

Save

Gp3Former: Gaussian Prior Tri-Cascaded Transformer for Video Instance Segmentation in Livestreaming Scenarios

Wensheng Li et al.Jan 1, 2024

Artificial Intelligence

Electrical And Electronic Engineering

0

Paper

Artificial Intelligence

Electrical And Electronic Engineering

Save

Predicting Postoperative Complications of TAVR Using Multimodal Feature Alignment Network

Xiaoguang Li et al.Jan 19, 2024

This study proposes a multimodal feature alignment network for early prediction of new-onset conduction disturbance after the Transcatheter Aortic Valve Replacement (TAVR) surgery. Based on medical prior knowledge, the baseline clinical information, calcification in the aortic root complex region, radionics features, and the CT slices , are used as multimodal data, and feature selection methods are used to select features that are highly correlated with the disease. Then, by using multimodal feature alignment and cross attention mechanisms between patterns, the data from multiple patterns is fully fused together to achieve information complementarity and prediction completion. Specifically, cosine similarity is employed to measure the similarity between image features and composite features in the feature space and perform feature alignment. Then, a cross-attention mechanism is employed to enhance the interdependence between composite features and image features, improve the predictive performance and interpretability of the model, and effectively integrate multimodal features. Experimental results show that our method achieved 90.48%, 84.21%, 94.12%, and 88.00% in accuracy, precision, recall, and specificity, respectively. As far as we know, we have not found similar reports on this topic. The proposed method provides a novel approach for predicting complications after TAVR, which is with great significance in clinical practice of the cardiac surgery.

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Intracranial Large Vessel Severe Stenosis and Occlusion Detection Based on Vision Transformer Fusion Model from Multi-parametric MRI

Mengzhou Sun et al.Nov 26, 2024

Motivation: Traditionally, large vessel severe stenosis and occlusion (LVSSO) detection based on CTA needs contrast agent exposure. It is important to develop a LVSSO detection approach using contrast agent-free MR images that can achieve results comparable to clinical doctors. Goal(s): To develop a new fusion algorithm that can achieve the accuracy comparable to clinical diagnostic levels. Approach: A new fusion algorithm model based on vision transformer was developed. A total of 380 patients were enrolled in the current study. Results: The proposed model achieved an AUC of 0.963 and an accuracy of 94.7%. Impact: The proposed model achieved satisfactory accuracy for LVSSO detection, i.e. 94.7%, indicating that the performance of the proposed model has reached the clinical diagnosis level.

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Spatial-temporal transformer network for protecting person-of-interest from deepfaking

Dingyu Lu et al.Jan 8, 2025

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Paper

Artificial Intelligence

Computer Vision And Pattern Recognition

Save