We used various digital methods to study the evolution of the concept open data, namely Google scraping, list-building and Wikipedia medium-specific features.
RQ1: To understand the resonance of open data amongst CSOs, we initially defined the pool of CSOs by building a list of eight different sectors of CSOs. We first aggregated keywords from high ranking and high volume petition websites (e.g. avaz.org and change.org). From these petitions, a list of top topics were identified based on the topics listed - in the case of change.org the topics were listed while avaz.org had a questionnaire that asked website users to vote on the most important topics. Lastly, the top petitions were manually reviewed and sorted. The topics formed the starting point in defining the thematic classifications needed to identify, and organise lists of CSOs. The categories were developed in a three-step process. The first categories were identified to be Human Rights, Environment, Health, Government and Politics. The topics were cleaned to reflect larger issues in which associated organisations with an online presence could be identified. The CSOs were identified, and their websites were classified into categories. This led to a second category classification, which resulted in: Human Rights, Environment, Health, Transparency and Financial Transparency. A last revision of the key topics, the CSO and the categories led to one last re-classification of categories, to ensure all major topics were properly represented in our CSO list. Global Health Development, Climate Change, Human Rights, International Transparency, Financial Transparency, International Anti-War Organizations, International Anti-Nuclear Organizations, Media and Democracy formed our final list. Privacy and Global Health were also part of this original set, however technical difficulties meant that we were unable to complete these areas during the allocated and time. These categories were then put through the digital method approach of list-building through URL links (Rogers 5) from editorial sources, such as Wikipedia lists, and the memberships of related networks and coalitions. Expertise from the team of researchers, such as Albana Shala from Free Press Unlimited, was used to curate the Media Development list. These lists were then inserted into DMIs Google Scraper tool, for queries into the number of times these organisations mentioned open data. The results for each can be found in Table 1.0. The data we collected from each website was visualised as the last step of this research. Table 1.0| Civil Society Area | Method of Acquiring List | Number of Websites |
| Climate Change | Triangulation of Climate Action Network and TckTckTck lists. | 74 |
| International Transparency | List curated by the Sunlight Foundation | 544 |
| Financial Transparency | Membership list of the Financial Transparency Coalition | 140 |
| Anti-War | 16 | |
| Anti-nuclear | 20 | |
| Media Development | Triangulation of Wikipedia, CIMA, Freedom of the Press Foundation, and Alban Shalas expertise from Free Press Unlimited | 92 |
| Democracy | Triangulation of Wikipedia, Democracy Promotion Organisations in the United States, UNDP Guide on Civil Society organisations working on Democratic Governance | 76 |
| Human Rights | Triangulation of Wikipedia, Human Rights Web, Crisis Connection, Human Trafficking Search and University of Minnesota Human Rights Library | 29 |
| Total Number of Websites | 991 |
Media Development was another high resulting data set with 3,693 mentions out of 92 websites (see the results in CSO lists figure 2.8). Only 26 organisations did not mention the term, meaning about 68% of the organisations under our study mentioned the term. Within the pool of websites, open data has a relatively high level of resonance. News organisations such as Deutsche Welle and Internews with media development programs were also amongst the top resonating websites. Advocacy media organisations such as Global Voices, ICFJ and Article 19 were amongst the top ranked websites to mention the term. (3,480 mentions; 81 sites/26 have zero mentions)
Philanthropic organisations did not resonate at significant numbers, with only 214 mentions within a pool of 35 websites. 16 out of the 36 websites we queried mentioned the term, meaning about 45% of our sample pool worked, or used the concept of open data. At the top of these lists were big foundations like the The Wellcome Trust, Macarthur Foundation, and the Getty Trust. These organisations however, do not maintain high levels of content regarding their projects, as such the level of digital footprints they maintain may explain its low level of resonance with open data. (214 mentions; 35 sites/19 have zero mentions)
Climate Change organisations had 516 mentions of open data, a relatively small number considering its pool of 75 websites. Over half of this URL set had zero mentions of open data. (516 mentions; 75 sites/43 have zero mentions)
Democracy oriented organisations had a fair amount of open data mentions, however most institutions were either academic institutions, or political foundations. (851 mentions; 66 sites/40 have zero mentions)
Anti-Nuclear organisations had about 101 mentions of open data, which came only from the greenpeace.org website. (101 mentions; 21 sites/18 have zero mentions)
Wikipedia 'open data' language versions: date of creation, number of edits and length of page. Retrieved: 17 Jan 2015.
RQ2-B: Although some of the TOCs terminology of the open data articles varies across the different Wikipedia language versions, the topics are similar. With the exception of the French TOC, no particular country-specific or language-specific topics are highlighted. The French TOC stands out as the most detailed by far, corresponding with the articles length. The shortest TOCs (such as Arabic, Hindi and Thai) are limited to an overview or definition of the term, links and references.

Wikipedia 'open data' language versions: Tables of Content. Retrieved: 17 Jan 2015.
RQ2-C: Analyzing the English version of the open data article, using the Contropedia tool, no apparent controversial issues nor major edits were evident. As Contropedia is mostly used for detecting controversies within an entry rather than cross-lingual entry comparison, and as no controversies were apparent on the English version, no additional language versions (which have significantly less edits) were examined using Contropedia.

Wikipedia 'open data' English version: Contropedia analysis. Retrieved: 17 Jan 2015.
RQ2-D: The most mentioned topics across all Wikipedia language versions of the open data article, are transparency and democracy, as well as public service. The English version is the only to refer to all topics as well as privacy, while the French version, although the largest in size, does not address the issue of privacy, but rather focuses on civil, public and economic advantages of open data. Efficiency and waste are only present in the Italian version, while privacy is addressed in the English, German and Serbian versions. The Spanish version focuses on open source, free software and open knowledge movements, while the Italian and Ukrainian pages refers mainly to open government. Some of the language versions, such as the Portuguese and Serbian pages, present arguments in favor and against open data. Six of the language versions (namely Catalan, Swedish, Thai, Hindi, Khmer and Tamil) do not include any of the above-mentioned terminology.

Wikipedia 'open data' language versions: issue mapping. Retrieved: 17 Jan 2015.
RQ2-E: The twenty-one Wikipedia open data language versions refer to 192 sources on total. The French version has the most references (59), followed by the English version (30) and the Japanese page (15). The Dutch, Catalan and Thai versions have only a single reference each, while the Arabic, Bulgarian, Hindi and Tamil versions have no references at all. No references are common to all language versions. Only one source is common to six of the language versions, namely the Science Commons website. The Open Definition website was common to five of the language versions. Five sources are common to four languages, while nine sources are shared among three language versions and eight sources are common to two language versions. As 168 of all sources appear only on one language version, most references are thus language-version specific. However, the language or origin of the reference sources does not necessarily correspond with the language version, with many pages referencing English-language sources, as may be expected, while the Japanese and Khmer pages, for example, reference Italian sources.

Wikipedia 'open data' language versions: references (sources) network. Retrieved: 17 Jan 2015.
RQ2-F: The wiki-links (internal outbound Wikipedia links) are composed of 940 nodes. 65% of the nodes are related to only one language, with the main links being Open Access, linked by 11 languages, followed by Open Source, linked by 10 languages, and then Internet, Open Content and Tim Berners-Lee. Thus, the most linked pages are related to open topics, such open government, open source and open access, as well as copyright- and licensing-related topics. Most pages are linked by one language, with the French page linking to 193 unique pages, followed by Russian (39), English (38), Japanese (29), German (23), Spanish (20) and Italian (15). However, many language versions of the same pages were not inner-linked within Wikipedia. Using Google Translate and Google Refine, we then reconciled those pages to create lists of unique pages. For most language versions, the unique links are mainly about national initiatives and institutions, yet some languages present interesting features. The Arabic version is the only one linking to censorship and privacy, while the Catalan has only one unique link to an open-data economic source. The French version, which is the largest in content, has the most unique links, but the majority are related to generic concepts.

Wikipedia 'open data' language versions: wiki-links network. Retrieved: 17 Jan 2015.

Wikipedia 'open data' language versions: wiki-links word cloud. Retrieved: 17 Jan 2015.
The back wiki-links (internal inbound Wikipedia links) are composed of 567 nodes. 56% of the nodes are related to only one language. The main links are Open Knowledge, Creative Commons and Linked Data, linked to 8 language versions respectively. The English version is the most linked-to page, followed by the French version. Some of the language versions also have back wiki-links to the English version of the Open Data entry.

Wikipedia 'open data' language versions: back wiki-links network. Retrieved: 17 Jan 2015.

Wikipedia 'open data' language versions: back wiki-links word cloud. Retrieved: 17 Jan 2015.
Open data is a rapidly growing topic of interest on both global and national levels, related to various concepts, as is well reflected in the civil society agenda and the open data Wikipedia languages spheres. Of the top 10 ranked countries in The Global Open Data Index, only seven countries have a related language-version of the open data article, with countries such as Finland, Denmark and Norway, where open data seems to be an issue of public importance, having no language-specific versions, while Cambodia, which is ranked as 76, has an open data article in Khmer. Thus, there is no necessary correlation between the national significance of open data and its presence on Wikipedia.
In the civil society agenda, the most predominant resonance was found amongst international transparency and media development organisations. However, such open-data related terminology and arguments are not as present on the various Wikipedia language versions, focused mainly on democracy and transparency. The wiki-links and the back wiki-links also indicate a clear relation between open data and the free-software and open-source movements, as most inner-wiki-links (links to pages within Wikipedia) are related to these topics. Within the civil society agenda, beyond the top resonating organisations to mention open data, a pattern evolved in relevance to open data. Through an examination of the nature of the organisations, and their online content, three types of organisations were identified, namely organisations promoting open data as a concept, organisations for whom open data is instrumental to achieving some other goal and organisations with overall low resonance with open data. It was expected that organisations that use open data in itself would scrape high queries. International transparency and media development organizations had the most mentions, and were most closely aligned with the ethos of open data, as most were digital rights organizations and advocacy initiatives related to access to information. The top ranking websites however, seemed to belong to government affiliated organisations such as data.gov.uk. Philanthropic organisations returned surprising results, with some of the lowest mentions of open data, possibly due to their websites often containing very little content regarding the specifics of their funded programs. The power of open data to appear amongst organisations who would use open data as instrumental towards another goal would perhaps underline the power of open data to effect change for significant social causes. Democracy promotion organisations, mainly academic institutions and politically-affiliated organizations, had the highest resonance with open data, while within the climate change data set, over half of the websites queried had no mention of open data. Human rights organisations had low resonance, with those organisations that maintained the most traction were often government affiliated, while NGOs had less use of the term. We surmised that these NGOs would rely on data related to information that is often hard to become aggregated and distributed within open data sources, such as demographics related to executions, torture, and prisoners. This finding ultimately speaks to the nature of information and research needed by human rights organisations, that open data has yet to enter into. The most surprising results belonged to anti-war organisations, who had 1 overall mention of open data. Significant results came from Facebook pages, which the Google Scraper could not query for mentions, and were thus omitted from our findings. However, the numbers in these results do not necessarily represent the traction of open data within this civil society sector, as those participating in this movement may not be organisations, but participants in protests, campaigns, and events which dont maintain digital footprints of their work that fit within the scope of this study. Anti-nuclear organisations only found traction with Greenpeace, which already was part of our climate change data set, so it is unclear if this result is an accurate reflection of this civil society sector. Thus, in the civil society agenda, open data finds the most significant resonance within areas related to transparency, and media development, corresponding with the Wikipedia language spheres, in which the issue finds traction in a similar area, as a digital commons topic.Given the limited scope of this study, further research into the ways each website utilises the concept of open data would be worthwhile. At the moment, this study was not able to search the depth in which some of these civil society organisations use open data. Furthermore, the list building process for the civil society organisations may have excluded a number of projects, and networks that did not maintain strong digital footprints. Additionally, the limitations of time, and difficulties with using the Google Scraper tool with large data sets meant that significant sets of data (such as our queries for open data amongst privacy) had to be omitted. Thus, we acknowledge the limitations of digital methods in capturing the work of civil society organisations, especially given the fluidity and unconventional structures within civil society, as they may exist as events, protests, or campaigns without associated websites.
Copyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.