A method for the massive extraction of syndication channels

Authors

  • Manuel Blázquez Ochando Departamento de Biblioteconomía y Documentación de la Facultad de Ciencias de la Documentación de la Universidad Complutense de Madrid

DOI:

https://doi.org/10.54886/scire.v23i1.4300

Keywords:

data mining, scraping, web crawler, content syndication, RSS, feeds

Abstract

One of the problems for investigating the informative production of syndication channels is counting on the sufficient number of sources from the same domain, subject or area of knowledge, to compile a sample. This is a consequence of the dispersion of information sources on the Web; the researcher’s difficulty in knowing all the available resources; and the difficulty in extracting and locating the links of syndication channels in every relevant web site or Internet resource that is discovered. This article discusses the method to extract and compile syndication channels through the composition of seeds using a web crawler, and the configuration and subsequent processing of the obtained links.

Downloads

Download data is not yet available.

Published

2017-06-13

How to Cite

Ochando, M. B. (2017). A method for the massive extraction of syndication channels. Scire: Knowledge Representation and Organization (ISSNe 2340-7042; ISSN 1135-3716), 23(1), 39–45. https://doi.org/10.54886/scire.v23i1.4300

Issue

Section

Articles