Where academic tradition
meets the exciting future

Incorporating External Information in Bayesian Classifiers Via Linear Feature Transformations

Tapio Pahikkala, Jorma Boberg, Aleksandr Mylläri, Tapio Salakoski, Incorporating External Information in Bayesian Classifiers Via Linear Feature Transformations. In: Tapio Salakoski, Filip Ginter, Sampo Pyysalo, Tapio Pahikkala (Eds.), Advances in Natural Language Processing, 5th International Conference on NLP, FinTAL 2006 Turku, Finland, August 23-25, 2006 Proceedings , Lecture Notes in Computer Science 4139/2006, 399–410, Springer Berlin / Heidelberg, 2006.

Abstract:

Naive Bayes classifier is a frequently used method in various natural language processing tasks. Inspired by a modified version of the method called the flexible Bayes classifier, we explore the use of linear feature transformations together with the Bayesian classifiers, because it provides us an elegant way to endow the classifier with an external information that is relevant to the task. While the flexible Bayes classifier is based on the idea of using kernel density estimation to obtain the class conditional probabilities of continuously valued attributes, we use the linear transformations to smooth the feature frequency counts of discrete valued attributes. We evaluate the method on the context sensitive spelling error correction problem using the Reuters corpus. For this particular task, we define a positional feature transformation and a word feature transformation that take advantage of the positional information of the context words and the part-of-speech information of words, respectively. Our experimental results show that the performance of the Bayesian classifiers in the natural language disambiguation tasks can be improved with the proposed transformations and that the incorporation of external information via the linear feature transformations is a promising research direction.

Files:

Full publication in PDF-format

BibTeX entry:

@INPROCEEDINGS{inpPaBoMySa06a,
  title = {Incorporating External Information in Bayesian Classifiers Via Linear Feature Transformations},
  booktitle = {Advances in Natural Language Processing, 5th International Conference on NLP, FinTAL 2006 Turku, Finland, August 23-25, 2006 Proceedings },
  author = {Pahikkala, Tapio and Boberg, Jorma and Mylläri, Aleksandr and Salakoski, Tapio},
  volume = {4139/2006},
  series = {Lecture Notes in Computer Science},
  editor = {Salakoski, Tapio and Ginter, Filip and Pyysalo, Sampo and Pahikkala, Tapio},
  publisher = {Springer Berlin / Heidelberg},
  pages = {399–410},
  year = {2006},
  keywords = {Machine learning, Naive Bayes, Flexible Bayes, Linear feature transformation, Disambiguation},
}

Belongs to TUCS Research Unit(s): Turku BioNLP Group

Publication Forum rating of this publication: level 1

Edit publication