The SSH Training Discovery Toolkit provides an inventory of training materials relevant for the Social Sciences and Humanities.

Use the search bar to discover materials or browse through the collections. The filters will help you identify your area of interest.

 

Wikiwho

Source icon
Source

The core functionality of wikiwho is to parse the complete set of all historical revisions (versions) of a wikipedia article in order to find out who wrote and/or removed which exact text at what point in time. this means that given a specific revision of an article (e.g., the current one) wikiwho can determine for each word and special character which user first introduced that word and if and how it was deleted/reintroduced afterwards. this functionality is not offered by wikipedia as such and wikiwho was shown to perform this task with very high accuracy (~95%) and very efficiently, being the only tool that has been scientifically proven to perform this task that well.

Intended audience
Formats
Language
Curated topics

Items

Title Description Collections
wikiwho api.

You can get word/token-wise information from which revision what content originated (and thereby which editor originally authored the word) as well as all changes a token was ever subject to.

Training Discovery Toolkit
Wikiwho source code

The original code plus some variants that contain extensions, especially a new function extracting relations between editors. Note that extended versions might include additional computational steps that can lead to higher runtimes than the original. 

Training Discovery Toolkit