Characterization of the argentinian web

Gabriel Tolosa, Fernando Bordignon, Ricardo Baeza-Yates, Carlos Castillo

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

This article presents the results of research on the characterization of the Argentinian web domain over a sample of almost 10 million web pages from 150.000 sites collected in the early 2006. Particularly, we have studied page contents, link structure and technologies used in the construction of the sites. The results are consistent with earlier research on other national web domains, where the same analysis methodology has been used. This study reveals a number of interesting facts: To begin with, there is a significant proportion (97.6%) of ".com.ar" domains with respect to other second level domains. As regards page contents, we have found a predominance of terms related to commercial activity. However, terms found in site names, extracted from their URLs, are mostly related to tourism. A large proportion of the pages (55%) do not have inbound links coming from other sites in the ".ar" domain while a 30% do not have outbound links. 72% of the pages have been created or modified in the last year, which indicates that the Argentinian web space is growing quickly. As for technologies, 48% of the pages from the sample are static and 52% dynamic, the latter being mostly built using free tools like PHP and Perl. Besides, 76% of the sites are hosted on servers geographically located in Argentina. These two facts show there is an important web-related technological development and communication infrastructure in Argentina.

Original languageEnglish
JournalCybermetrics
Volume11
Issue number1
Publication statusPublished - 1 Dec 2007
Externally publishedYes

Fingerprint

Argentina
technical development
Tourism
infrastructure
communication
methodology

Keywords

  • Argentinian web domain
  • Link analysis
  • Web characterization
  • Web Size Projection
  • Webometrics

ASJC Scopus subject areas

  • Library and Information Sciences

Cite this

Tolosa, G., Bordignon, F., Baeza-Yates, R., & Castillo, C. (2007). Characterization of the argentinian web. Cybermetrics, 11(1).

Characterization of the argentinian web. / Tolosa, Gabriel; Bordignon, Fernando; Baeza-Yates, Ricardo; Castillo, Carlos.

In: Cybermetrics, Vol. 11, No. 1, 01.12.2007.

Research output: Contribution to journalArticle

Tolosa, G, Bordignon, F, Baeza-Yates, R & Castillo, C 2007, 'Characterization of the argentinian web', Cybermetrics, vol. 11, no. 1.
Tolosa G, Bordignon F, Baeza-Yates R, Castillo C. Characterization of the argentinian web. Cybermetrics. 2007 Dec 1;11(1).
Tolosa, Gabriel ; Bordignon, Fernando ; Baeza-Yates, Ricardo ; Castillo, Carlos. / Characterization of the argentinian web. In: Cybermetrics. 2007 ; Vol. 11, No. 1.
@article{53b9c75fffbd492daadf0b11814b605d,
title = "Characterization of the argentinian web",
abstract = "This article presents the results of research on the characterization of the Argentinian web domain over a sample of almost 10 million web pages from 150.000 sites collected in the early 2006. Particularly, we have studied page contents, link structure and technologies used in the construction of the sites. The results are consistent with earlier research on other national web domains, where the same analysis methodology has been used. This study reveals a number of interesting facts: To begin with, there is a significant proportion (97.6{\%}) of {"}.com.ar{"} domains with respect to other second level domains. As regards page contents, we have found a predominance of terms related to commercial activity. However, terms found in site names, extracted from their URLs, are mostly related to tourism. A large proportion of the pages (55{\%}) do not have inbound links coming from other sites in the {"}.ar{"} domain while a 30{\%} do not have outbound links. 72{\%} of the pages have been created or modified in the last year, which indicates that the Argentinian web space is growing quickly. As for technologies, 48{\%} of the pages from the sample are static and 52{\%} dynamic, the latter being mostly built using free tools like PHP and Perl. Besides, 76{\%} of the sites are hosted on servers geographically located in Argentina. These two facts show there is an important web-related technological development and communication infrastructure in Argentina.",
keywords = "Argentinian web domain, Link analysis, Web characterization, Web Size Projection, Webometrics",
author = "Gabriel Tolosa and Fernando Bordignon and Ricardo Baeza-Yates and Carlos Castillo",
year = "2007",
month = "12",
day = "1",
language = "English",
volume = "11",
journal = "Cybermetrics",
issn = "1137-5019",
publisher = "Centro de Informacion y Documentacion Cientifica",
number = "1",

}

TY - JOUR

T1 - Characterization of the argentinian web

AU - Tolosa, Gabriel

AU - Bordignon, Fernando

AU - Baeza-Yates, Ricardo

AU - Castillo, Carlos

PY - 2007/12/1

Y1 - 2007/12/1

N2 - This article presents the results of research on the characterization of the Argentinian web domain over a sample of almost 10 million web pages from 150.000 sites collected in the early 2006. Particularly, we have studied page contents, link structure and technologies used in the construction of the sites. The results are consistent with earlier research on other national web domains, where the same analysis methodology has been used. This study reveals a number of interesting facts: To begin with, there is a significant proportion (97.6%) of ".com.ar" domains with respect to other second level domains. As regards page contents, we have found a predominance of terms related to commercial activity. However, terms found in site names, extracted from their URLs, are mostly related to tourism. A large proportion of the pages (55%) do not have inbound links coming from other sites in the ".ar" domain while a 30% do not have outbound links. 72% of the pages have been created or modified in the last year, which indicates that the Argentinian web space is growing quickly. As for technologies, 48% of the pages from the sample are static and 52% dynamic, the latter being mostly built using free tools like PHP and Perl. Besides, 76% of the sites are hosted on servers geographically located in Argentina. These two facts show there is an important web-related technological development and communication infrastructure in Argentina.

AB - This article presents the results of research on the characterization of the Argentinian web domain over a sample of almost 10 million web pages from 150.000 sites collected in the early 2006. Particularly, we have studied page contents, link structure and technologies used in the construction of the sites. The results are consistent with earlier research on other national web domains, where the same analysis methodology has been used. This study reveals a number of interesting facts: To begin with, there is a significant proportion (97.6%) of ".com.ar" domains with respect to other second level domains. As regards page contents, we have found a predominance of terms related to commercial activity. However, terms found in site names, extracted from their URLs, are mostly related to tourism. A large proportion of the pages (55%) do not have inbound links coming from other sites in the ".ar" domain while a 30% do not have outbound links. 72% of the pages have been created or modified in the last year, which indicates that the Argentinian web space is growing quickly. As for technologies, 48% of the pages from the sample are static and 52% dynamic, the latter being mostly built using free tools like PHP and Perl. Besides, 76% of the sites are hosted on servers geographically located in Argentina. These two facts show there is an important web-related technological development and communication infrastructure in Argentina.

KW - Argentinian web domain

KW - Link analysis

KW - Web characterization

KW - Web Size Projection

KW - Webometrics

UR - http://www.scopus.com/inward/record.url?scp=45949093531&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=45949093531&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:45949093531

VL - 11

JO - Cybermetrics

JF - Cybermetrics

SN - 1137-5019

IS - 1

ER -