Open Set Google Preferences to: 'Number of results: display 100 results per page'. Save preferences.

Query for Scroll down to the bottom of the results page. To check the number of available results, click results page 10. Here one notes whether there are 1000 results available (the maximum served by Google Web search) or fewer.

Copy and paste all 10 pages of results in a file and save the file. Input the text in the Harvester. Select 'Only return hosts' and 'Only return uniques.' Launch Harvester. Copy paste the result in a file and clean up the list for URLs that are aberrant, i.e. URLs that are not For a more exhaustive list, please also query Google for and just Repeat data cleaning procedure as above, where duplicates and non-Iraqi URLs are removed.

Enter the URL list in the timestamp ripper and launch the tool.


One notes the different levels of attention paid to the governmental sites. Whereas certain sites are 'fresh', most likely through the use of dynamically generated content, other Iraqi governmental sites are stale, with page timestamps up to a year and a half old.

Take care also to visit the front pages of the Websites under analysis, and check their status. In the example of the Iraqi Web, the oldest homepage, (the North Oil Company) is a splash page, where the user chooses English or Arabic. The English and Arabic front-pages have these URLs, respectively: and Entering those two pages into the timestamp ripper shows that the Arabic page is fresher (4 November 2008) than the English (24 September 2008), and that overall both pages are 1-2 months old.

Topic revision: r2 - 21 Dec 2008, RichardRogers
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback