A method for the massive extraction of syndication channels
DOI:
https://doi.org/10.54886/scire.v23i1.4300Keywords:
data mining, scraping, web crawler, content syndication, RSS, feedsAbstract
One of the problems for investigating the informative production of syndication channels is counting on the sufficient number of sources from the same domain, subject or area of knowledge, to compile a sample. This is a consequence of the dispersion of information sources on the Web; the researcher’s difficulty in knowing all the available resources; and the difficulty in extracting and locating the links of syndication channels in every relevant web site or Internet resource that is discovered. This article discusses the method to extract and compile syndication channels through the composition of seeds using a web crawler, and the configuration and subsequent processing of the obtained links.Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2017 Authors retain their copyright, but transfer the exploitation rights (reproduction, distribution, public communication and transformation) to the journal in a non-exclusive way and guarantee the right to the first publication of their work to the journal, which will be simultaneously subjected to the license CC BY-NC-ND. Authors take whole personal responsibility on fulfilling all the appropiate ethical codes and laws, and obtaining all the necessary copyright permissions regarding their articles. Institutional and self- archiving is allowed and encouraged.
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
© 1996- . Authors retain their copyright, but transfer the exploitation rights (reproduction, distribution, public communication and transformation) to the journal in a non-exclusive way and guarantee the right to the first publication of their work to the journal, which will be simultaneously subjected to the license CC BY-NC-ND. Authors take whole personal responsibility on fulfilling all the appropiate ethical codes and laws, and obtaining all the necessary copyright permissions regarding their articles. Institutional and self- archiving is allowed and encouraged.