Abstract Given the growing number of clinically integrated cancer single-cell transcriptomic studies, robust differential enrichment methods for gene signatures to dissect tumor cellular states for discovery and translation are critical. Current analysis strategies neither adequately represent the hierarchical structure of clinical single-cell transcriptomic datasets nor account for the variability in the number of recovered cells per sample, leading to results potentially confounded by sample-driven biology with high false positives instead of accurately representing true differential enrichment of group-level biology (e.g., treatment responders vs. non-responders). This problem is especially prominent for single-cell analyses of the tumor compartment, because high intra-patient similarity (as opposed to inter-patient similarity) results in stricter hierarchical structured data that confounds enrichment analysis. Furthermore, to identify signatures which are truly representative of the entire group, there is a need to quantify the robustness of otherwise statistically significant signatures to sample exclusion. Here, we present a new nonparametric statistical method, BEANIE, to account for these issues, and demonstrate its utility in two cancer cohorts stratified by clinical groups to reduce biological hypotheses and guide translational investigations. Using BEANIE, we show how the consideration of sample-specific versus group biology greatly decreases the false positive rate and guides identification of robust signatures that can also be corroborated across different cell type compartments.