BACKGROUND: Cancer Research Network (CRN) sites use administrative data to populate their Virtual Data Warehouse (VDW). However, information on VDW chemotherapy data validity is limited. The purpose of this study was to assess the validity of VRN chemotherapy data.METHODS: This was a retrospective, cohort study of women >/=18 years with incident, invasive breast cancer diagnosed between January 1999 and December 2007. Pharmacy and procedure chemotherapy data were extracted from each site’s VDW. Random samples of 50 patients stratified on trastuzumab, anthracyclines, and no chemotherapy exposure were selected from each site for detailed chart abstraction. Weighted sensitivities and specificities of VDW compared to abstracted data were calculated. Cumulative doses calculated from VDW data were compared to doses obtained from the medical chart review.RESULTS: The cohort included 13497 patients with 6456 (48%) chart-review eligible. Patients in the sample (N=400) had a mean age of 65 years. Trastuzumab, anthracycline, and other chemotherapy weighted sensitivities were 95%, 97%, and 100%, respectively; specificities were 99%, 99%, and 93%, respectively; positive predictive values were 96%, 99%, and 55%, respectively; and negative predictive values were 99%, 96%, and 100%. Trastuzumab and anthracyclines VDW mean doses were 873 mgs and 386 mgs, respectively, while abstracted mean doses were 1734 mgs and 369 mgs, respectively (R2=0.14, p<0.01 and R2=0.05, p=0.03, respectively).CONCLUSIONS: Sensitivities and specificities for CRN chemotherapy VDW data were high and dosages were correlated with chart information. Impact: The findings support the use of CRN data in evaluating chemotherapy exposures and related outcomes.