Crash modification factor (CMF) is an effectiveness measure of safety countermeasures. It is widely used by state agencies to evaluate and prioritize various safety improvement projects. The Federal Highway Administration (FHWA) CMF Clearinghouse provides CMFs for a broad range of countermeasures, but still, the existing CMFs often cannot meet the needs for characterizing the safety impacts of countermeasures in new scenarios. Developing CMFs, meanwhile, is costly, time-consuming, and requires extensive data collection. A more cost-effective way to provide preliminary CMF estimations is needed. To address this need, this project will aim to develop a low-cost and easily extendable data-driven framework for CMF predictions. This framework will perform data mining on existing CMF records in the FHWA CMF Clearinghouse. To tackle the heterogeneity of data, interdisciplinary techniques to maintain model compatibility will be created and used. The project also will integrate multiple machine-learning models to learn the complex hidden relationships between different safety countermeasure scenarios. Finally, the proposed framework will be trained against the CMF Clearinghouse data and perform comprehensive evaluations.