Organix: Creating Organic Objects from Document Feature Vectors

Organix: Creating Organic Objects from Document Feature Vectors

Robert J. Hendley, Barry Wilkins, Russell Beale
DOI: 10.4018/jcicg.2010010104
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This article presents a mechanism for generating visually appealing but also effective representations for document visualisation. The mechanism is based on an organic growth model which is driven by features of the object to be visualised. In the examples used, the authors focus on the visualisation of text documents, but the methods are readily transferable to other domains. They are also scaleable to documents of any size.The objective of this research is to build visual representations that enable the human visual system to efficiently and effectively recognise documents without the need for higher level cognitive processing. In particular, the authors want the user to be able to recognise similarities within sets of documents and to be able to easily discriminate between dissimilar objects.
Article Preview
Top

A wide variety of document collection visualisations have been developed. Bead (Chalmers & Chiston, 1992) uses physically based modelling techniques to produce document clusters. This approach can be computationally complex, an alternative and more efficient algorithm has been developed (Chalmers, 1996). WEBSOM as described by Lagus et al. (1996) uses a self organising map (SOM) algorithm to produce a map of documents with similar documents located in closely related regions of the map. Themescapes (Wise et al., 1995) visualises the thematic content of a document collection as a 3D landscape, stronger themes are give a higher elevation. A network is used by Singhal and Salton (1995), Salton (1995), the resulting structure of the network and the number of incident lines (or degree) at a particular node can give insights into the core documents or paragraphs within a particular article. The research and approaches used for text visualisation are extensive. Card et al. (1999, p409-461) contains a selection of papers discussing 1D, 2D and 3D text visualisation. A comprehensive review of document visualisation has been written by Morse (1998).

A novel system is described by Roher et al. (1998). This approach generates a document feature vector, maps the weights for each feature to distances along each axis and the eight bisecting quadrants, place spheres at the end points and finally produce a 3D amorphous shape. This allows up to 14 dimensions of the document to be viewed as a single shape. Documents can then be compared, with similar documents having similar shapes. It was this idea that inspired the current work.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 13: 2 Issues (2022): 1 Released, 1 Forthcoming
Volume 12: 2 Issues (2021)
Volume 11: 2 Issues (2020)
Volume 10: 2 Issues (2019)
Volume 9: 2 Issues (2018)
Volume 8: 2 Issues (2017)
Volume 7: 2 Issues (2016)
Volume 6: 2 Issues (2015)
Volume 5: 2 Issues (2014)
Volume 4: 2 Issues (2013)
Volume 3: 2 Issues (2012)
Volume 2: 2 Issues (2011)
Volume 1: 2 Issues (2010)
View Complete Journal Contents Listing