References
- (Berger et al., 1996): A. Berger, A. Della Pietra, and J. Della Pietra. A maximum entropy approach to natural language processing. Computational Linguistics, pp.39-71, No.1, Vol.22, 1996.
- (Lafferty et al., 2001): J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In the proceedings of International Conference on Machine Learning (ICML), pp.282-289, 2001.
- (Liu and Nocedal, 1989): D. Liu and J. Nocedal. On the limited memory BFGS method for large-scale optimization. Mathematical Programming, pp.503-528, Vol.45, 1989.
- (Sha and Pereira, 2003): F. Sha and F. Pereira. Shallow parsing with conditional random fields. In the proceedings of Human Language Technology/North American chapter of the Association for Computational Linguistics annual meeting (HLT/NAACL), 2003.
Acknowledgements
We would like to thank professor Tu-Bao Ho for providing us Penn Treebank data for evaluation.
We would like to thank professor Jorge Nocedal, Department of Electrical and Computer Engineering, School of Engineering and Applied Science, Northwestern University, for providing L-BFGS FORTRAN source code. www.ece.northwestern.edu/~nocedal
The C version of L-BFGS used in this project is borrowed from CRF++ project developed by Taku Kudo (www.chasen.org/~taku/software/CRF++). We would like to thank him for his open source project.
A part of this project, the training section (e.g., the computation of log-likelihood function and its gradient vector), is based on the Java source code of CRF project developed by professor Sunita Sarawagi, KR School of Information Technology, IIT Bombay. We would like to thank professor Sunita Sarawagi for sharing her CRF package and answering related question. www.it.iitb.ac.in/~sunita
We would like to thank Sourceforge.net for hosting this project.