ResearchHub | Open Science Community

Weakly Supervised Object Localization with Progressive Domain Adaptation

Dong Li et al.Jun 1, 2016

We address the problem of weakly supervised object localization where only image-level annotations are available for training. Many existing approaches tackle this problem through object proposal mining. However, a substantial amount of noise in object proposals causes ambiguities for learning discriminative object models. Such approaches are sensitive to model initialization and often converge to an undesirable local minimum. In this paper, we address this problem by progressive domain adaptation with two main steps: classification adaptation and detection adaptation. In classification adaptation, we transfer a pre-trained network to our multi-label classification task for recognizing the presence of a certain object in an image. In detection adaptation, we first use a mask-out strategy to collect class-specific object proposals and apply multiple instance learning to mine confident candidates. We then use these selected object proposals to fine-tune all the layers, resulting in a fully adapted detection network. We extensively evaluate the localization performance on the PASCAL VOC and ILSVRC datasets and demonstrate significant performance improvement over the state-of-the-art methods.

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Paper

Artificial Intelligence

227

0

Save

0

A survey of recent advances in visual feature detection

Yali Li et al.Aug 11, 2014

Philosophy

Artificial Intelligence

0

Paper

Save

Linkage Based Face Clustering via Graph Convolution Network

Zhongdao Wang et al.Jun 1, 2019

In this paper, we present an accurate and scalable approach to the face clustering task. We aim at grouping a set of faces by their potential identities. We formulate this task as a link prediction problem: a link exists between two faces if they are of the same identity. The key idea is that we find the local context in the feature space around an instance (face) contains rich information about the linkage relationship between this instance and its neighbors. By constructing sub-graphs around each instance as input data, which depict the local context, we utilize the graph convolution network (GCN) to perform reasoning and infer the likelihood of linkage between pairs in the sub-graphs. Experiments show that our method is more robust to the complex distribution of faces than conventional methods, yielding favorably comparable results to state-of-the-art methods on standard face clustering benchmarks, and is scalable to large datasets. Furthermore, we show that the proposed method does not need the number of clusters as prior, is aware of noises and outliers, and can be extended to a multi-view version for more accurate clustering accuracy.

Artificial Intelligence

Paleontology

0

Paper

Artificial Intelligence

187

0

Save

0

Self Attention Re-encoding and Linguistic Ability Preserving for Context-Aware Video Captioning

Mengqiang Xiao et al.Apr 19, 2024

History

Philosophy

0

Paper

Save

UPL-Net: Uncertainty-aware prompt learning network for semi-supervised action recognition

Shu Yang et al.Dec 1, 2024

Artificial Intelligence

Biomedical Engineering

0

Paper

Artificial Intelligence

Biomedical Engineering

0

Save

0

Factors Influencing Chinese University English Majors' Ability to Analyze 19th Century Romantic Poetry

Yali Li et al.Nov 16, 2024

This thesis explores the factors influencing the ability of Chinese university English majors to analyze 19th-century Romantic poetry. As Romantic poetry is characterized by its complex language, rich symbolism, and deep emotional content, it presents unique challenges for students. This study aims to identify and understand the key factors—motivation, interest, exposure to Romantic poetry, cultural knowledge, and Teaching Methods—that impact students' analytical abilities. The research adopts a mixed-methods approach, combining quantitative surveys with qualitative interviews to gather comprehensive data from students and educators across several Chinese universities. The study also examines the moderating role of critical thinking skills in enhancing students' ability to engage with and interpret Romantic poetry. Findings from this research reveal that motivation and interest are strongly impacted with improved analytical skills, with students who are more engaged showing a higher proficiency in literary analysis. Exposure to Romantic poetry, both in and out of the classroom, was found to significantly impact students' familiarity with the genre, which in turn enhanced their analytical abilities. Cultural knowledge, particularly an understanding of the historical and philosophical context of the Romantic period, was also identified as a crucial factor in the depth and accuracy of students' analyses. Moreover, the study highlights the importance of innovative Teaching Methods, such as interactive and multimedia approaches, in facilitating a deeper engagement with Romantic poetry. Traditional lecture-based methods, while useful, were found to be less effective in addressing the unique challenges posed by this literary genre. The research concludes with recommendations for educators, curriculum designers, and policymakers to develop strategies that can better support the development of analytical skills in students studying Romantic poetry. The study also suggests areas for future research, particularly in exploring the applicability of these findings across other literary genres and educational contexts.

Information Systems

Psychology

0

Paper

Save

Research on the effect of processing methods on ginseng quality and key technologies for improvement

Yafei Qiao et al.Jun 19, 2024

As an important medicinal plant, ginseng has various pharmacological activities such as antioxidant, anti-tumor, and anti-inflammatory. The magnitude of these pharmacological activities is closely related to ginseng quality, so to improve the pharmacological activities of ginseng, it is first necessary to improve its quality. In addition to ginseng variety, cultivation, origin, harvest time, and other factors, the impact of processing on ginseng quality is also crucial. This article reviews the different methods of ginseng processing and their impact on ginseng quality, and proposes some strategies to improve ginseng quality through processing methods, aiming to provide technological support for the high-quality development of the ginseng industry.

Philosophy

Molecular Biology

0

Paper

Save

Spectral convolutional neural network chip for in-sensor edge computing of incoherent natural light

Kaiyu Cui et al.Jan 2, 2025

Optical neural networks are considered next-generation physical implementations of artificial neural networks, but their capabilities are limited by on-chip integration scale and requirement for coherent light sources. This study proposes a spectral convolutional neural network (SCNN) with matter meta-imaging. The optical convolutional layer is implemented by integrating very large-scale and pixel-aligned spectral filters on CMOS image sensor. It facilitates highly parallel spectral vector-inner products of incident incoherent natural light i.e., the direct information carrier, which empowers in-sensor optical analog computing at extremely high energy efficiency. To the best of our knowledge, this is the first integrated optical computing utilizing natural light. We employ the same SCNN chip for completely different real-world complex tasks and achieve accuracies of over 96% for pathological diagnosis and almost 100% for face anti-spoofing at video rates. These results indicate a feasible and scalable in-sensor edge computing chip of natural light for various portable terminals. Optical neural networks have been successfully realized on-chip level, yet to operate it requires coherent light sources. Here, the authors proposed integrated computing based on natural light by combining large-scale spectral filters on CMOS image sensor to facilitate vector-inner products from incoherent light.

Artificial Intelligence

Paleontology

0

Paper

Artificial Intelligence

Paleontology

0

Save

0

Divergent Synthesis of 17‐nor‐Cephalotane Diterpenoids through Developed Ynol‐diene Cyclization

Zezhong Sun et al.Jul 8, 2024

On the basis of a novel ynol‐diene cyclization developed as a rapid access to tropone unit, the first divergent strategy to 17‐nor‐cephalotane diterpenoids has been successfully established. Combining with a bioinspired stereoselective dual hydrogenation, the divergent total synthesis of (+)‐3‐deoxyfortalpinoid F, (+)‐harringtonolide, (−)‐fortalpinoids M/N/P, and analog (−)‐20‐deoxocephinoid P have been achieved in 14‐17 linear longest steps starting from commercially available materials.

Molecular Biology

Organic Chemistry

0

Paper

Save

UniDetector: Towards Universal Object Detection with Heterogeneous Supervision

Zhenyu Wang et al.Jan 1, 2024

In this paper, we formally address universal object detection, which aims to detect every category in every scene. The dependence on human annotations, the limited visual information, and the novel categories in open world severely restrict the universality of detectors. We propose UniDetector, a universal object detector that recognizes enormous categories in the open world. The critical points for UniDetector are: 1) it leverages images of multiple sources and heterogeneous label spaces in training through image-text alignment, which guarantees sufficient information for universal representations. 2) it involves heterogeneous supervision training, which alleviates the dependence on the limited fully-labeled images. 3) it generalizes to open world easily while keeping the balance between seen and unseen classes. 4) it further promotes generalizing to novel categories through our proposed decoupling training manner and probability calibration. These contributions allow UniDetector to detect over 7k categories, the largest measurable size so far, with only about 500 classes participating in training. Our UniDetector behaves the strong zero-shot ability on large-vocabulary datasets - it surpasses supervised baselines by more than 5% without seeing any corresponding images. On 13 detection datasets with various scenes, UniDetector also achieves state-of-the-art performance with only a 3% amount of training data.

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Paper

Artificial Intelligence

Computer Vision And Pattern Recognition

0

Save