Current Proceedings on Technology

Current Proceedings on Technology

Effect of Tagged-Terms on Web Page Classification Accuracy

Yazarlar: Havva Esin Unal, Selma Ayse Ozel, Ilker Unal

Cilt 3 , Sayı - , 2013 , Sayfalar -

Konular:-

Anahtar Kelimeler:Web page classification,Web mining,HTML tags,Accuracy

Özet: The Web is a large collection of heterogeneous documents growing daily. Related to this increased data, it is becoming difficult to effectively reach to useful information from this environment. For this purpose, an automatic Web page classification mechanism is needed to extract the documents in desired topics. In earlier studies on Web page classification, it has been concluded that using HTML tags affects classification accuracy positively. In this study, our aim is to show the effect of each HTML tag separately on classification accuracy of several classifiers. To show the effect of each tag on classification accuracy, HTML tags and terms in each tag are used as separate features. We observed that different tag sets give high classification accuracy for different datasets, however, using features extracted from anchor tags provides higher classification performance in the majority of the datasets.


ATIFLAR
Atıf Yapan Eserler
Henüz Atıf Yapılmamıştır

KAYNAK GÖSTER
BibTex
KOPYALA
@article{2013, title={Effect of Tagged-Terms on Web Page Classification Accuracy}, volume={3}, number={0}, publisher={Current Proceedings on Technology }, author={Havva Esin Unal, Selma Ayse Ozel, Ilker Unal}, year={2013} }
APA
KOPYALA
Havva Esin Unal, Selma Ayse Ozel, Ilker Unal. (2013). Effect of Tagged-Terms on Web Page Classification Accuracy (Vol. 3). Vol. 3. Current Proceedings on Technology .
MLA
KOPYALA
Havva Esin Unal, Selma Ayse Ozel, Ilker Unal. Effect of Tagged-Terms on Web Page Classification Accuracy. no. 0, Current Proceedings on Technology , 2013.