Link Ripper
Capture all internal links and/or outlinks from a page.
Instructions
Enter one or more URLs, choose internal links and/or outlinks. Internal links are links within the same domain, external links are links outside of the target domain.
Sample project
Capture external links from the Wikipedia article 'Climate Change'. Input article URL (
http://en.wikipedia.org/wiki/Climate_change), select 'outlinks' and rip. After further cleaning, with links to other Wikipedia and Wikimedia domains removed, the list appears as follows:
http://dx.doi.org/10.1038%2Fnature05699
http://dx.doi.org/10.1016%2FS0009-2541%2899%2900084-4
http://volcanoes.usgs.gov/Hazards/What/VolGas/volgas.html
http://www.virtualcentre.org/en/library/key_pub/longshad/A0701E00.htm
http://www.guardian.co.uk/environment/2008/may/12/climatechange.carbonemissions
etc.
The WebsiteThe Website Issues with the object of study Investigations into Websites have been dominated by user and "eyeball studies," where attempts at a navigation poet...
Cross Lingual Art Spaces On WikipediaCross lingual Art Spaces on Wikipedia Sub group members: Sangeet Kumar, Garance Coggins, Sarah McMonagle, Stephan Schlögl, Han Teng Liao, Michael Stevenson, Feder...
Dmi ProtocolsProtocols devised by the DMI This page is being replaced gradually by our new research protocols and methods page. Hyperlink Analysis * Perfom an issue craw...
Emaps Amsterdam Sprint 2014 Project 4 Uses And Users Of Vulnerability IndexesUses and Users of Climate Change Vulnerability Indices: Mapping the Reputation of Indices in Climate Change Adaptation Spaces Issue experts Sönke Kreft, Matthew ...
Summer School 2007Digital Methods Summer School 2007: New Objects of Study 2010 2009 2008 2007 How does one do research online? What are the new objects of study, and how do ...
Test HomeDMI Tools Digital Methods Project Overview FAQ Tag Cloud Introduction The Digital Methods Initiative is a contribution to doing research into the "nati...
Tool Harvester How ToInput text in the harvester to extract URLs. Tip: On a website, view source. Copy and paste source code into harvester in order to extract the URLs (or embedded l...