Abstract Across the tree of life, gene lengths vary, but most are no more than a few thousand base pairs in length. The largest protein often reported is the ∼40,000 aa eukaryotic Titin. Even larger proteins may occur in the rapidly expanding set of metagenome-derived sequences, but their existence may be obscured by assembly fragmentation. Here, we leverage genome curation to complete metagenome-derived sequences that encode predicted proteins of up to 85,804 aa. Overall, the findings illuminate a huge knowledge gap related to giant proteins. Although predicted proteins of >30,000 aa occur in bacterial phyla such as Firmicutes and Actinobacteria , they are most common in ca. Omnitrophota, ultra small bacteria that adopt predatory lifestyles. All full length giant genes encode numerous transmembrane regions and most encode divergent secA DEAD helicase domains. In silico structural prediction of protein subregions was required to identify domains in unannotated protein segments, and revealed putative domains implicated in attachment and carbohydrate degradation. Many giant genes in new complete and near-complete Omnitrophota genomes occur in close proximity to genes homologous to type II secretion systems as well as carbohydrate import systems. This, in combination with the domain content, suggests that many bacterial giant proteins enable prey adhesion and cell wall digestion during bacterial predation.