Proceedings of the BioNLP Workshop on Linking Natural Language Processing and Biology at HLT-NAACL 06, page 81,
New York City, June 2006. c©2006 Association for Computational Linguistics
Procter and Gamble Keynote Speech:  
Mining biomedical texts for disease-related pathways 
 
 Andrey Rzhetsky  
 Columbia University 
 andrey.rzhetsky@dbmi.columbia.edu  
  
 
 
 
 
Abstract 
I will describe my collaborators' and my 
own effort to compile large models of 
molecular pathways in complex human 
disorders. 
The talk will address a number of interre-
lated questions: 
How to extract facts from texts at a large 
scale? 
How to assess the quality of the extracted 
facts? 
How to identify sets of conflicting or un-
reliable facts and to generate an internally 
consistent model? 
How to use the resulting pathway model 
for automated generation of biological 
hypotheses? 
 
81
