• Link to LinkedIn
  • Link to X
  • Link to Facebook
  • Link to Youtube
  • Link to Mail
Web Intelligence and Visual Analytics
webLyzard technology
  • Home
  • Solutions
    • Product Portfolio
    • Technology Showcases
  • Platform
    • Dashboard Overview
    • Visualization Tools
    • Data Services
  • Research
    • Research Projects
    • Horizon Europe Funding
    • Horizon Europe Dissemination
    • Publications
  • News
    • Latest Updates
    • Release History
    • Newsletter
  • About
    • Contact Details
    • Partners and Clients
    • Privacy Policy
  • Menu Menu
Cluster Map Thumbnail

Cluster Map – Search Related Documents

What is a Cluster Map?

Search queries often return an overwhelming number of online documents. A cluster map is an intuitive way to group these search results by topic. By identifying similar documents, it helps to better understand the structure of online coverage and other large document collections. The visual representation of the cluster map arranges documents by their semantic similarity. The system then assigns each document to a specific cluster, which acts as a local gravity point. As a result, the largest node rests at the center and attracts other nodes that belong to this cluster.

Cluster Map Layout

To achieve an appealing and easy to read display, the visualization combines clustering methods with a force-directed layout algorithm. It then highlights groups of similar documents by a convex hull shape that visually holds its nodes together. The size of this shape is dynamic and depends on the number of contained nodes. Each of the nodes, variable in size and color, represents one document:

  • Node color reflect the selected metadata attributes. This can be the result of text classification, for example, or extracted affective knowledge such as document sentiment – ranging from red (negative) to grey (neutral) and green (positive). The saturation depends on the degree of polarity. Vivid colors indicate emotional articles, lower saturation a more factual coverage
  • Node size is an optional feature of the cluster map. It can indicate the reach of the document’s source, for example. In this case, a node representing CNN.com article is rather large compared to an article of a local community site.

Single Cluster with Sentiment Color Coding

Three keyword labels per cluster describe its contents. Initially, the system computes these labels based on the document keywords within the cluster. This process considers the reach of the documents’ sources to optimize the selection. Reducing the opacity of nodes and their hull shapes increases the label’s readability and reduces the overall visual load.

Interactive Cluster Map Features

  • Hovering over a document cluster hides its keywords and highlights its shape and nodes through higher opacity. The colors of its nodes become more vivid.
  • Clicking on a cluster triggers a new search, narrowing down the set of results to documents within the selected cluster.
  • Hovering over a single node highlights this node with an orange stroke. Additionally, a tooltip shows document keywords and the favicon of the source.

Cluster Map based on Document Similarity - Search Term: Biodiversity
Cluster Map based on the search term biodiversity, color-coded by sub-topic: climate change, rainforest, wildlife and ocean

Clustering Process

Keyword clustering tools have to balance accuracy and scalability. Common methods to build such tools include the Louvain method for community detection as well as K-means, which divides the collection of documents into a fixed amount of clusters. Each document belongs to the cluster with the nearest centroid. The story detection algorithm of webLyzard pursues a similar approach. It uses time slices to extract keywords and is particularly suited for the real-time clustering of very large document collections.

References

  1. Jain, A.K. (2010). Data Clustering: 50 Years Beyond K-means, Pattern Recognition Letters, 31(8): 651-666.
  2. Syed, K.A.A., Kröll, M., Sabol, V., Scharl, A. and Gindl, S. (2012). Incremental and Scalable Computation of Dynamic Topography Information Landscapes, Journal of Multimedia Processing and Technologies, 3(1): 49-65.
https://www.weblyzard.com/data/sites/21/cluster-map.png 280 280 Arno Scharl https://www.weblyzard.com/data/sites/21/weblyzard-logo-2020.png Arno Scharl2014-03-23 19:29:202025-01-06 06:17:36Cluster Map – Search Related Documents
Search Search

CATEGORIES

  • News & Events
  • Use Cases
  • Data Services
  • Visualizations
  • Research Projects

Recent Updates

  • AI Visibility Tracking – Monitoring Generative Engine ResultsJanuary 19, 2026 - 5:14 am
  • TRANSMIXR Presentation at IBC 2025 - Newsroom AI Toolbox
    Newsroom of the Future at IBC 2025September 29, 2025 - 9:42 pm
  • Sustainability Reporting with Generative AIJuly 20, 2025 - 11:59 am
  • CLAIM Project - Thumbnail
    Hybrid AI Models to Detect DisinformationApril 21, 2025 - 9:22 pm
  • Generative AI (GenAI) Thumbnail
    Generative AI for Content LifecyclesMarch 18, 2025 - 8:22 pm

About

webLyzard technology is an Austrian SME founded in 2008. The unique capabilities of its big data platform are based on a strong R&D track record in the fields of knowledge extraction, artificial intelligence, visualization and the integration of geospatial and semantic Web technologies.

web·Lyz·ard

Function: intelligence platform; Etymology: composed from web (as in World Wide Web) and lyzard (as in analyzer). 1 : (broadly) enriches digital content; identified by its speed, accuracy and scalability. 2 : predicts trends to gain a deeper understanding of information flows.

Visual Tools

  • Trend Chart Thumbnail
    Trend Chart – Dynamic Content MetricsOctober 18, 2020 - 9:00 am
  • Story Graph / Streamgraph Thumbnail
    Story Detection and Story Graph VisualizationApril 10, 2020 - 9:02 am
  • Geographic Map of Europe
    Geographic Map – Geospatial AnalyticsOctober 18, 2019 - 11:15 am

Data Services

  • AI Visibility Tracking – Monitoring Generative Engine ResultsJanuary 19, 2026 - 5:14 am
  • Wildcard Search and Regular Expressions
    Wildcard Search and Regular ExpressionsJanuary 9, 2025 - 4:46 am
  • Knowledge Graph - SKB - Thumbnail
    Knowledge Graph – Semantic Knowledge BaseNovember 28, 2024 - 10:00 am
Link to: Named Entity Recognition – Recognyze Link to: Named Entity Recognition – Recognyze Named Entity Recognition – Recognyzerecognyze logo - named entity recognitionLink to: Game of Thrones Stimmungsbarometer Link to: Game of Thrones Stimmungsbarometer Thumbnail of Game of Thrones - Westeros SentinelGame of Thrones Stimmungsbarometer
Scroll to top Scroll to top Scroll to top