Identifying which patients should undergo serologic screening for celiac disease (CD) may help diagnose patients who otherwise often experience diagnostic delays or remain undiagnosed. Using anonymized outpatient data from the electronic medical records of Maccabi Healthcare Services, we developed and evaluated a gradient boosted trees model to classify patients as at-risk for CD autoimmunity prior to first documented diagnosis or positive serum tissue transglutaminase (tTG-IgA). A train set of highly seropositive (tTG-IgA > 10X ULN) cases (n = 677) with likely CD and controls (n = 176,293) with no evidence of CD autoimmunity was used for model development. Input features included demographic information and commonly available laboratory results. The model was then evaluated for discriminative ability as measured by AUC on a distinct set of highly seropositive cases (n = 153) and controls (n = 41,087). Performance was estimated at one (AUC = 0.84), two (AUC = 0.81), three (AUC = 0.82) and four (AUC = 0.79) years prior to first documented evidence of disease. The model’s ability to distinguish cases of incident CD autoimmunity from controls shows promise as a potential clinical tool to identify patients at increased risk for having undiagnosed celiac disease in the community for serologic screening.