John Pestian, PhD, and colleagues have used natural language processing to create the Cincinnati Pediatric Corpus (CPC), a collection of 600,000 words of HIPAA-anonymized clinical data approved for release by the Institutional Review Board of Cincinnati Children's Hospital Medical Center. [more]