Autor(es): Miranda, João ; Gomes, Daniel
Data: 2009
Identificador Persistente: http://hdl.handle.net/10400.26/468
Origem: FCT – Computação Científica Nacional FCCN
Assunto(s): Portuguese web; Web characterization
Autor(es): Miranda, João ; Gomes, Daniel
Data: 2009
Identificador Persistente: http://hdl.handle.net/10400.26/468
Origem: FCT – Computação Científica Nacional FCCN
Assunto(s): Portuguese web; Web characterization
This study presents an updated characterization of the Portuguese Web derived from a crawl of 48 million contents belonging to all media types (2.5 TB of data), performed in March, 2008. The resulting data was analyzed to characterize contents, sites and domains. This study was performed within the scope of the Portuguese Web Archive.
POSC/EU, UMIC