The discrepancy between chronological age and estimated brain age, known as the brain age gap, may serve as a biomarker to reveal brain development and neuropsychiatric problems. This has motivated many studies focusing on the accurate estimation of brain age using different features and models, of which the generalizability is yet to be tested. Our recent study has demonstrated that conventional machine learning models can achieve high accuracy on brain age prediction during development using only a small set of selected features from multimodal brain imaging data. In the current study, we tested the replicability of various brain age models on the Adolescent Brain Cognitive Development (ABCD) cohort. We proposed a new refined model to improve the robustness of brain age prediction. The direct replication test for existing brain age models derived from the age range of 8-22 years onto the ABCD participants at baseline (9 to 10 years old) and year-two follow-up (11 to 12 years old) indicate that pre-trained models could capture the overall mean age failed precisely estimating brain age variation within a narrow range. The refined model, which combined broad prediction of the pre-trained model and granular information with the narrow age range, achieved the best performance with a mean absolute error of 0.49 and 0.48 years on the baseline and year-two data, respectively. The brain age gap yielded by the refined model showed significant associations with the participants' information processing speed and verbal comprehension ability on baseline data.