You are here: Foswiki>Dmi Web>DmiTools>HowTo (17 Oct 2008, AnneHelmond)Edit Attach

HOWTO

IC URL Extractor

Description

Extracts URLs from an Issuecrawler .xml file

Links

tool: ToolExtractUrls

results: on screen

Input

isscuecrawler xml source file

Output

list of URLs

Use

Get link location of issuecrawler xml source file (see Issuecrawler 1.2)

Select either 'Full URLs' or 'Only host' as output

Submit will return a list of URLs devided into 'Startingpoints', 'URLs or Hosts in Network' and 'All URLs and Hosts, excluding Startingpoints'

Issue Geographer

Description

Tool to geolocate URLs

Links

tool: ToolIssueGeographer

result log:on screen


Input -Issuecrawler xml source file
-List of URLs

Output

svg map with geolocated and sized flags

Use

Choose to browse for and xml file or input URLs directly or start from an empty list

Fill in your email (xml input) or title of the map (urls input)

Empty list will get you straight into the 'edit and view' section

Previous generated maps are show at the bottom of the screen

You'll get a notification of your request stating an email will be send when the file is done

Opening up the file brings you in the 'edit and view' section

For each URL it retreives title, location, region, country, lat, lon, visible, type and action.

In this field you can manually edit these fields or fill in missing data

Choose show to show the map or save to first save it locally

The map will show flags on the locations of the hosting, sized by number

Actor Profiler

Description

The script will calculate the top 10 nodes out of the selected issuecrawler network map by indegree, query googlenews for the issue you specify, get the pagerank for those top 10 nodes and returns them and the actor profile graphic for those results. The colors for the actor profile are taken from the svg.

Links

tool: ToolActorProfiler

results: http://tools.issuecrawler.net/beta/results/actorProfiler/

relating tool: ToolIssueCrawler

Input

-issue keyword

-issuecrawler network map

Output

-pagerank for top 10 nodes -actor profile graphics in svg

Use

1. Select an issue

2. Browse for your issuecrawler network map

3. Click profile to see the results

Tag Cloud Generator (counting)

Description

Takes and counts raw text or a Google result and returns an ordered, unordered or alphabetically ordered tagcloud.

Links

tool: ToolRawTextToTagCloud

results: http://tools.issuecrawler.net/beta/results/svg

Relating tool: ToolGoogleScraper

Input

-Raw text or a Scrape Google result list url

Output

html, svg and pdf

Use
  1. For 'titel' input a unique name for the tagcloud
  2. For 'method', describe how the input data has been gathered. This could copied Google search returns
  3. In 'query', specify the query used to retrieve the data like a Google search quer
  4. In the input field, specify the Scrape Google url result list or enter raw text into the textfield area
  5. Select a constrain to either minimize the number of charaters or the number of occurences for each keyword in the cloud

PDF/SVG Tag Cloud Generator

Description

Input tags and values to produce an ordered, unordered or alphabetically ordered tagcloud in PDF and SVG that can edited for print.

Links

Tool: ToolTagCloudGenerator

Results: http://tools.issuecrawler.net/beta/results/svg

Related tool: ToolRawTextToTagCloud Tag Cloud Generator (counting)

Input
  • previous results from the Tagcloud counter
  • a tag cloud in the following format: tag (4) word (2) bla (17)

Output

PDF and SVG

Use
  1. The PDF and SVG contain a legenda with a Title, Method and Query description.
  2. For 'Title' input a unique name for the tagcloud
  3. For 'Method', describe how the input data has been gathered. This could copied output from the related tools: Tag Cloud Generator (counting) or the Google Scraper.
  4. In 'Query', specify the query used to retrieve the data like a Google search query (if applicable)

Issue Discovery Tool

Description

Discovers issues from a dataset by analysing and counting keywords. These keywords are found by doing basic language analysis.

Links

tool: ToolIssueDiscovery

results: on screen

related tool: ToolIssueCrawler
Input

-xml, a list of urls or raw text

Output

A list of couted keywords by url

Use

Select the desired input and submit

Description

Use Yahoo API to discover the incoming links to one or several urls.

Links

tool: ToolYahooInlinks

results:

related tool:


Input

-url's

-Name for result file

Output

Count and list of in-links per url

Use

Input one or more url's (one per line), specify a result file name and submit

Compare List Tool

Description

OPtional clean-up input lists and compare them for commonalities and differences.

Links

tool: ToolCompareLists

results: on screen

Input

-two list of urls

-selection on clean-up actions

Output

Comparison between two list of urls. Output depends on selected actions

Use

  1. Input two list of urls (one per line).
  2. Select the preffered action
  3. Compare lists
Topic revision: r4 - 17 Oct 2008, AnneHelmond
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback