ARCHIVES
Research Article
An Exploratory Review on Data from Web Page Using Content Mining Methodology
SHARMA PREETI1
ICL INSTITUTE OF ENGINEERING AND TECHNOLOGY- Haryana, India.
Published Online: January-April 2022
Pages: 13-15
Cite this article
No DOIReferences
1. S. Baluja, “Browsing on smalls screens: Recasting Web-page segmentation in to an efficient machine learning
framework”, Proceedings of the 15th International Conference on World Wide Web, pp. 33–42, 2006.
2. M. Baroni, F. Chantree, A. Kilgarri, S. Sharoff, “Cleaneval: A competition for cleaning Web pages”, Proceedings of
the sixth International Conference on Language Resources and Evaluation, 2008
3. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Vips: vision-based page segmentation algorithm. Technical report, Microsoft
Research, 2003.
4. H. F. Laender, B. A. Ribeiro-Neto, A. S. da Silva,and J. S. Teixeira. A brief survey of web data extraction tools.
SIGMOD Rec., 31(2):84-93, 2002.
5. Y. Yesilada, ―Web Page Segmentation: A Review,ǁ eMINE Technical Report Deliverable 0 (D0), 2011.
6. [6]. Y. Yesilada, ―Heuristics for Visual Elements of Web Pages,ǁ eMINE Technical Report Deliverable 1 (D1), 2011.
7. Zhao Xin-xin, Suo Hong-guang, Liu Yu-shu. Web Content Information Extraction Method Based on Tag Window.
Application Research of Computers. 2007,24(3).-144-145,180.
8. Pan Donghua, Qiu Shaogang. Web Page Content Extraction Method Based on Link Density and Statistic. The 4Th
International Conference.
9. F. R. Rahman, H. Alam and R. Hartono “Content Extraction from HTML Documents”
10. Wolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou “Language Modeling for Content Extraction in
Human- Computer Dialogues”.
framework”, Proceedings of the 15th International Conference on World Wide Web, pp. 33–42, 2006.
2. M. Baroni, F. Chantree, A. Kilgarri, S. Sharoff, “Cleaneval: A competition for cleaning Web pages”, Proceedings of
the sixth International Conference on Language Resources and Evaluation, 2008
3. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Vips: vision-based page segmentation algorithm. Technical report, Microsoft
Research, 2003.
4. H. F. Laender, B. A. Ribeiro-Neto, A. S. da Silva,and J. S. Teixeira. A brief survey of web data extraction tools.
SIGMOD Rec., 31(2):84-93, 2002.
5. Y. Yesilada, ―Web Page Segmentation: A Review,ǁ eMINE Technical Report Deliverable 0 (D0), 2011.
6. [6]. Y. Yesilada, ―Heuristics for Visual Elements of Web Pages,ǁ eMINE Technical Report Deliverable 1 (D1), 2011.
7. Zhao Xin-xin, Suo Hong-guang, Liu Yu-shu. Web Content Information Extraction Method Based on Tag Window.
Application Research of Computers. 2007,24(3).-144-145,180.
8. Pan Donghua, Qiu Shaogang. Web Page Content Extraction Method Based on Link Density and Statistic. The 4Th
International Conference.
9. F. R. Rahman, H. Alam and R. Hartono “Content Extraction from HTML Documents”
10. Wolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou “Language Modeling for Content Extraction in
Human- Computer Dialogues”.
Related Articles
2022
Review on Detection and Rectification of Distorted Fingerprint
2022
Case Study of Student Information tracking System
2022
Investigation of Novel Algorithm for Compact Computational Time by Using Fuzzy Practice in Data Mining
2022
Investigation on Video Genus Recognition Using Gestures of the Viewer
2022
Gesture Recognition of the Viewer using Videos and Deep Learning
2022