ResearchHub | Open Science Community

XX

Xiaofei Xie

Author with expertise in Empirical Studies in Software Engineering

Achievements

Cited Author

Key Stats

Upvotes received:

0

Publications:

15

(40% Open Access)

Cited by:

551

h-index:

33

/

i10-index:

90

Reputation

Biology

< 1%

Chemistry

< 1%

Economics

< 1%

Show more

How is this calculated?

Publications

DeepHunter: a coverage-guided fuzz testing framework for deep neural networks

Xiaofei Xie et al.Jul 10, 2019

The past decade has seen the great potential of applying deep neural network (DNN) based software to safety-critical scenarios, such as autonomous driving. Similar to traditional software, DNNs could exhibit incorrect behaviors, caused by hidden defects, leading to severe accidents and losses. In this paper, we propose DeepHunter, a coverage-guided fuzz testing framework for detecting potential defects of general-purpose DNNs. To this end, we first propose a metamorphic mutation strategy to generate new semantically preserved tests, and leverage multiple extensible coverage criteria as feedback to guide the test generation. We further propose a seed selection strategy that combines both diversity-based and recency-based seed selection. We implement and incorporate 5 existing testing criteria and 4 seed selection strategies in DeepHunter. Large-scale experiments demonstrate that (1) our metamorphic mutation strategy is useful to generate new valid tests with the same semantics as the original seed, by up to a 98% validity ratio; (2) the diversity-based seed selection generally weighs more than recency-based seed selection in boosting the coverage and in detecting defects; (3) DeepHunter outperforms the state of the arts by coverage as well as the quantity and diversity of defects identified; (4) guided by corner-region based criteria, DeepHunter is useful to capture defects during the DNN quantization for platform migration.

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Hawkeye

Hongxu Chen et al.Oct 15, 2018

Grey-box fuzzing is a practically effective approach to test real-world programs. However, most existing grey-box fuzzers lack directedness, i.e. the capability of executing towards user-specified target sites in the program. To emphasize existing challenges in directed fuzzing, we propose Hawkeye to feature four desired properties of directed grey-box fuzzers. Owing to a novel static analysis on the program under test and the target sites, Hawkeye precisely collects the information such as the call graph, function and basic block level distances to the targets. During fuzzing, Hawkeye evaluates exercised seeds based on both static information and the execution traces to generate the dynamic metrics, which are then used for seed prioritization, power scheduling and adaptive mutating. These strategies help Hawkeye to achieve better directedness and gravitate towards the target sites. We implemented Hawkeye as a fuzzing framework and evaluated it on various real-world programs under different scenarios. The experimental results showed that Hawkeye can reach the target sites and reproduce the crashes much faster than state-of-the-art grey-box fuzzers such as AFL and AFLGo. Specially, Hawkeye can reduce the time to exposure for certain vulnerabilities from about 3.5 hours to 0.5 hour. By now, Hawkeye has detected more than 41 previously unknown crashes in projects such as Oniguruma, MJS with the target sites provided by vulnerability prediction tools; all these crashes are confirmed and 15 of them have been assigned CVE IDs.

Information Systems

0

Paper

Save

Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Ming Hu et al.Aug 24, 2024

Although Federated Learning (FL) enables global model training across clients without compromising their raw data, due to the unevenly distributed data among clients, existing Federated Averaging (FedAvg)-based methods suffer from the problem of low inference performance. Specifically, different data distributions among clients lead to various optimization directions of local models. Aggregating local models usually results in a low-generalized global model, which performs worse on most of the clients. To address the above issue, inspired by the observation from a geometric perspective that a well-generalized solution is located in a flat area rather than a sharp area, we propose a novel and heuristic FL paradigm named FedMR (Federated Model Recombination). The goal of FedMR is to guide the recombined models to be trained towards a flat area. Unlike conventional FedAvg-based methods, in FedMR, the cloud server recombines collected local models by shuffling each layer of them to generate multiple recombined models for local training on clients rather than an aggregated global model. Since the area of the flat area is larger than the sharp area, when local models are located in different areas, recombined models have a higher probability of locating in a flat area. When all recombined models are located in the same flat area, they are optimized towards the same direction. We theoretically analyze the convergence of model recombination. Experimental results show that, compared with state-of-the-art FL methods, FedMR can significantly improve the inference accuracy without exposing the privacy of each client.

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

FlexFL: Heterogeneous Federated Learning via APoZ-Guided Flexible Pruning in Uncertain Scenarios

Z. Chen et al.Nov 1, 2024

Artificial Intelligence

Sociology And Political Science

0

Paper

Artificial Intelligence

Sociology And Political Science

Save

Neuron Sensitivity Guided Test Case Selection

Dong Huang et al.Jun 12, 2024

Deep Neural Networks (DNNs) have been widely deployed in software to address various tasks (e.g., autonomous driving, medical diagnosis). However, they can also produce incorrect behaviors that result in financial losses and even threaten human safety. To reveal and repair incorrect behaviors in DNNs, developers often collect rich, unlabeled datasets from the natural world and label them to test DNN models. However, properly labeling a large number of datasets is a highly expensive and time-consuming task. To address the above-mentioned problem, we propose NSS, Neuron Sensitivity Guided Test Case Selection, which can reduce the labeling time by selecting valuable test cases from unlabeled datasets. NSS leverages the information of the internal neuron induced by the test cases to select valuable test cases, which have high confidence in causing the model to behave incorrectly. We evaluated NSS with four widely used datasets and four well-designed DNN models compared to the state-of-the-art (SOTA) baseline methods. The results show that NSS performs well in assessing the probability of failure triggering in test cases and in the improvement capabilities of the model. Specifically, compared to the baseline approaches, NSS achieves a higher fault detection rate (e.g., when selecting 5% of the test cases from the unlabeled dataset in the MNIST&LeNet1 experiment, NSS can obtain an 81.8% fault detection rate, which is a 20% increase compared with SOTA baseline strategies).

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

CaBaFL: Asynchronous Federated Learning via Hierarchical Cache and Feature Balance

Zeke Xia et al.Nov 1, 2024

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Ratchet: Retrieval Augmented Transformer for Program Repair

Li Wang et al.Oct 28, 2024

Artificial Intelligence

Computer Networks And Communications

0

Paper

Artificial Intelligence

Computer Networks And Communications

Save

Themis: Automatic and Efficient Deep Learning System Testing with Strong Fault Detection Capability

Dong Huang et al.Oct 28, 2024

Artificial Intelligence

0

Paper

Artificial Intelligence

Save

Klotho improves Der p1-induced bronchial epithelial cell damage by inhibiting endoplasmic reticulum stress to regulate mitochondrial function

Chunguang Wang et al.Nov 30, 2024

0

Paper

Save

: Towards Fine-Grained Unknown Class Detection Against the Open-Set Attack Spectrum With Variable Legitimate Traffic

Ziming Zhao et al.Jan 1, 2024

Artificial Intelligence

Computational Mechanics

0

Paper

Artificial Intelligence

Computational Mechanics

Save

Load More

Scan to connect with one of our mobile apps

Coinbase Wallet app

Coinbase Wallet app

Connect with your self-custody wallet

Coinbase app

Coinbase app

Connect with your Coinbase account

QR Code

Open Coinbase Wallet app
Tap Scan

Or try the Coinbase Wallet browser extension

Connect with dapps with just one click on your desktop browser
Add an additional layer of security by using a supported Ledger hardware wallet