Wiki143:WikiProject Wikidemia/Quant/Arch

From Wikipedia, the free encyclopedia
Revision as of 18:12, 26 June 2006 by imported>Erik Garrison
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Parser

  • This converts the zipped xml database dumps into csv files with file specification:....

Stats

  • csv files of header information can be read into Statistical software packages R and Stata

Analysis

Figure Production

Table Production

Data Anomalies

In the Indonesian Wikipedia dump occasionally usernames appear in the <ip> tag (e.g. user:Vyasa). These appear to be localized to 2003. It is not clear why this occurs.