DMI Tools


Amazon Book Explorer
Launch toolInstructions & Scenarios of Use
Provides different analytics for Amazon.com's book search
 
Bubble Lines
Launch toolInstructions & Scenarios of Use
Input tags and values to produce relatively sized bubbles. Output is an svg.
 
Censorship Explorer
Launch toolInstructions & Scenarios of Use
Check whether a URL is censored in a particular country by using proxies located around the world.
 
Colors For Data Scientists
Launch toolInstructions & Scenarios of Use
Generate and refine palettes of optimally distinct colors. (by Médialab Sciences-Po)
 
Compare Lists
Launch toolInstructions & Scenarios of Use
Compare two lists of URLs for their commonalities and differences.
 
Compare Networks Over Time
Launch toolInstructions & Scenarios of Use
Compares Issue Crawler networks over time, and displays ranked actor lists. The over time module is best used in tandem with the Issue Crawler scheduler. The results may be plotted to line g...
 
Convert Issuecrawler to Navicrawler
Launch toolInstructions & Scenarios of Use
Convert an Issuecrawler XML file into the WXSF format of the Navicrawler file. For Navicrawler, see http://webatlas.fr/wp/navicrawler/.
 
Deduplicate
Launch toolInstructions & Scenarios of Use
Replicates the tags in a tag cloud by their value
 
Discus Comment Scraper
Launch toolInstructions & Scenarios of Use
This tool scrapes threads and comments from websites implementing the Disqus commenting system.
 
Dorling Map Generator
Launch toolInstructions & Scenarios of Use
Input tags and values to produce a Dorling Map (i.e. bubbles). Output is an svg.
 
Expand Tiny Urls
Launch toolInstructions & Scenarios of Use
Expands URLs that have been shortened by tools like tinyurl.com or bit.ly. Often used in social media such as Twitter or Facebook.
 
Extract URLs
Launch toolInstructions & Scenarios of Use
Extracts URLs from an Issuecrawler result file (.xml). Useful for retrieving starting points as well as a clean list of the actors in the network.
 
Geo Extraction
Launch toolInstructions & Scenarios of Use
Extracts geographic locations from text.
 
Geo IP
Launch toolInstructions & Scenarios of Use
Translates URLs or IP addresses into geographical locations
 
Google Autocomplete
Launch toolInstructions & Scenarios of Use
Retrieves autocomplete suggestions from Google
 
Google Blog Search Scraper
Launch toolInstructions & Scenarios of Use
Batch queries Google Blog Search. Query the resonance of a particular term, or a series of terms, in a set of blogs.
 
Google Image Scraper
Launch toolInstructions & Scenarios of Use
Query images.google.com with one or more keywords, and/or use images.google.com to query specific sites for images.
 
Google News Scraper
 
Googlescraper (Lippmannian Device)
Launch toolInstructions & Scenarios of Use
Batch queries Google. Query the resonance of a particular term, or a series of terms, in a set of Websites.
 
Harvester
Launch toolInstructions & Scenarios of Use
Extract URLs from text, source code or search engine results. Produces a clean list of URLs.
 
Image Scraper
Launch toolInstructions & Scenarios of Use
Scrape images from a single page.
 
Internet Archive Wayback Machine Link Ripper
Launch toolInstructions & Scenarios of Use
Scrapes links from the Wayback Machine
 
Internet Archive Wayback Machine Network Per Year
Launch toolInstructions & Scenarios of Use
Enter a set of URLs and the archived versions closest to 1 July for a specific year are retrieved. Thereafter links are extracted and a network file is output.
 
Issue Discovery Tool
Launch toolInstructions & Scenarios of Use
Enter URLs, and discover the most relevant words and phrases contained in them. One also may enter text, or an Issuecrawler result file (.xml). Based on Yahoo Term Extraction
 
Issue Dramaturg
Launch toolInstructions & Scenarios of Use
Enter up to 3 URLs as well as a key word. The Issuedramaturg queries Google for the key word, and shows the Pageranks of the URLs over time. The output is a graph of the Pagerank of the URLs...
 
Issue Geographer
Launch toolInstructions & Scenarios of Use
Geo-locates the organizations on an Issue Crawler map, using whois information, and visualizes the organizations' registered locations on a geographical map.
 
Issuecrawler
Launch toolInstructions & Scenarios of Use
Enter URLs and the Issue Crawler performs co-link analysis in one, two or three iterations, and outputs a cluster graph. The Issue Crawler also has modules for snowball crawling (up to 3 deg...
 
Itunes Store
 
Language Detection
Launch toolInstructions & Scenarios of Use
Detects language for given URLs. The first 500 characters on the Web page(s) are extracted, and the language of each page is detected.
 
Like Scraper
Launch toolInstructions & Scenarios of Use
Retrieves the number of likes on Facebook for a series of URLs
 
Link Ripper
Launch toolInstructions & Scenarios of Use
Capture all internal links and/or outlinks from a page.
 
Lippmannian Device
Launch toolInstructions & Scenarios of Use
The Lippmannian device is named after Walter Lippmann, and provides a coarse means of showing actor partisanship.
 
Lippmannian Device To Gephi
Launch toolInstructions & Scenarios of Use
This tool allows one to visualize the output of the Lippmannian device as a network with Gephi.
 
Netvizz
Launch toolInstructions & Scenarios of Use
Extracts various datasets from Facebook.
 
NetvizzToSentiStrength
Launch toolInstructions & Scenarios of Use
Adds sentiment analysis to short texts via Sentistrength
 
Open Calais
Launch toolInstructions & Scenarios of Use
Discovers the most relevant words and phrases among a set of websites, within a text, or within an issue network. Based on Reuters Open Calais.
 
Ranked Deep Pages from Core Issue Crawler Network
Launch toolInstructions & Scenarios of Use
Enter an Issuecrawler XML file and this script will get out all pages from the core network and rank those by pages by inlink count.
 
Raw Text to Tag Cloud Engine
Launch toolInstructions & Scenarios of Use
Takes raw text, counts the words and returns an ordered, unordered or alphabetically ordered tagcloud.
 
Rip Sentences
Launch toolInstructions & Scenarios of Use
Rip text from a specified page and force line breaks between sentences.
 
Robots.txt Discovery
Launch toolInstructions & Scenarios of Use
Display a site's robot exclusion policy.
 
Source Code Search
Launch toolInstructions & Scenarios of Use
loads a URL and searches for patterns in the page's source code
 
TLD counts
Launch toolInstructions & Scenarios of Use
Enter URLS, and count the top level domains.
 
Table to Net
Launch toolInstructions & Scenarios of Use
Extract a network from a table. Set a column for nodes and a column for edges. It deals with multiple items per cell. (by Médialab Sciences-Po)
 
Tag Cloud Combinator
Launch toolInstructions & Scenarios of Use
Enter two or more tag clouds and the values of each tag will be summed.
 
Tag Cloud Generator
Launch toolInstructions & Scenarios of Use
Input tags and values to produce a tag cloud. Output is in SVG.
 
Tag Cloud HTML Generator
Launch toolInstructions & Scenarios of Use
Input tags and values in wordle format to produce a HTML tag cloud or tag list.
 
Tag Cloud To Wordle
Launch toolInstructions & Scenarios of Use
This tool allows one to transform a normal tag cloud into a fancy Wordle one.
 
Text Ripper
Launch toolInstructions & Scenarios of Use
Rip all non-html (i.e. text) from a specified page.
 
Timestamp Ripper
Launch toolInstructions & Scenarios of Use
Rips and displays a web page's last modification date (using the page's HTML header). Beware of dynamically generated pages, where the date stamps will be the time of retrieval.
 
Tracker Tracker
Launch toolInstructions & Scenarios of Use
The tool Tracker Tracker detects from a predefined set of 'fingerprints' of web technologies, such as social buttons and trackers, and outputs a list of technologies used.
 
Tree Map Generator
Launch toolInstructions & Scenarios of Use
Input tags and values to produce a Tree Map. Output is in svg.
 
Triangulation
Launch toolInstructions & Scenarios of Use
Enter two or more lists of URLs or other items to discover commonalities among them. Possible visualizations include a Venn Diagram.
 
Twitter Capture and Analysis Toolset (DMI-TCAT)
Launch toolInstructions & Scenarios of Use
Captures tweets and allows for multiple analyses (hashtags, mentions, users, search, ...)
 
Wikipedia Cross-Lingual Image Analysis
Launch toolInstructions & Scenarios of Use
Makes the images of all language versions of a Wikipedia article comparable.
 
Wikipedia Edits Scraper and IP Localizer
Launch toolInstructions & Scenarios of Use
Scrapes Wikipedia history and does IP to Geo for anonymous edits
 
Wikipedia Entry Check
Launch toolInstructions & Scenarios of Use
This tool checks if the issues exist as a Wikipedia page, i.e., an article. If it exists it checks whether the organization is mentioned on that page.
 
Wikipedia History Flow Companion
Launch toolInstructions & Scenarios of Use
This script allows you to specify the range of Wikipedia revisions for use with IBM's History Flow visualization. Normally, History flow only allows you to input a maximum of 100 edits.
 
Wikipedia TOC Scraper
Launch toolInstructions & Scenarios of Use
Scrape Table of Contents for revisions of a wikipedia page and explore the results by moving a slider to browse across chronologically ordered TOCs.
 
 
Topic revision: r12 - 26 Jun 2013 - 19:25:39 - Erik Borra