Current advances in sequencing technology have greatly increased the availability of sequence data from public genetic databases. With data from GenBank, we assemble and phylogenetically investigate a 19,740-taxon, five-locus supermatrix (i.e., atpB, rbcL, matK, matR , and ITS) for rosids, a large clade containing over 90,000 species, or approximately a quarter of all angiosperms (assuming an estimate of 400,000 angiosperm species). The topology and divergence times of the five-locus tree generally agree with previous estimates of rosid phylogeny, and we recover greater resolution and support in several areas along the rosid backbone, but with a few significant differences (e.g., the placement of the COM clade, as well as Myrtales, Vitales, and Zygophyllales). Our five-locus phylogeny is the most comprehensive DNA data set yet compiled for the rosid clade. Yet, even with 19,740 species, current sampling represents only 16-22% of all rosids, and we also find evidence of strong phylogenetic bias in the accumulation of GenBank data, highlighting continued challenges for species coverage. These limitations also exist in other major angiosperm clades (e.g., asterids, monocots) as well as other large, understudied branches of the Tree of Life, highlighting the need for broader molecular sampling. Nevertheless, the phylogeny presented here improves upon sampling by more than two-fold and will be an important resource for macroevolutionary studies of this pivotal clade.