Using Bins to Empirically Estimate Term Weights for Text Categorization
by Carl Sable And Kenneth W. Church.

References

F. Jelinek. 1998. Statistical Methods for Speech recognition. MIT Press.
T. Joachims. 1998. Text categorization with support vector machines: Learning with many relevant features. ECML.
S. Katz. 1996. Distribution of content words and phrases in text language modeling.
D. Lewis. 1997. Reuters-21578 text categorization test collection, readme file (ver. 1.2)
D. Lewis 1998. Naive (Bayes) at forty: The independence assumption in IR. ECML.
A. McCallum. 1996. Bow: A toolkit for statistical language modeling, text retrieval, classification, and clustering. 
C. Sable and V. Hatziassiloglou. 2000. Test-based approaches for non-topical image categorization. International Journal of Digital Libraries.
C. Sable. 2000. Categorizing multimedia documents using associated text (thesis proposal). 
G. Salton and C. Buckley. 1988. Term weighting approaches in automatic text retrieval. Information Processing and Management.
G. Salton. 1989. Automatic Text Processing: The Transformation, analysis, and retrieval of information by computer. Addison-Wesley.
R. Schapire and Y. Singer. 2000. Boostexter: A boosting-based system for text categorization. Machine Learning.
K. Umemura and K. W. Church. 2000. Empirical term weighting and expansion frequency. EMNLP/VLC 2000.
C. J. van Rijsbergen. 1979. Information Retrieval. Butterworths.
Y. Lang and X. Liu. 1999. A re-examination of text categorization models.  SIGIR 1999.
