Posts filed under 'categorisation'

Semantic Analysis Technology

I attended “Semantic Analysis Technology: in Search of Categories, Concepts & Context“, the fourth ISKO UK KOnnecting KOmmunities event on 3 November 2008 at University College London.

First up were presentations from two vendors, Luca Scagliarini and Jeremy Bentley.

Scagliarini argued that information discovery suffers from information overload and information underload due to a lack of meaning-based text processing. Free text search and shallow automatic linguistic analysis did not do the job, but a ‘deep semantic analysis’ based on the analysis of relationships and ‘understanding’ the meaning that is encoded in the relationships between verbs, prepositions and nouns demonstrates potential.

Bentley reviewed key information organisation issues – unstructured information, the doubling of number of resources every 19 months, ‘findability’ problems and and the how black box solutions may not do the job. He discussed the relevance of metadata and taxonomies built specifically to reflect the way an organisation workss.

Later, practitioners presented – Rob Lee and Helen Lippell, Karen Loasby and Silver Oliver.

Lee talked about Muddy Boots, a BBC project to support the BBC’s remit to link to more external sources. Lee illustrated how structured datasets in the public domain could be used to contextualise and index BBC resources and exploit the semantic richness to link to find meaningful external links.

Lippell, Loasby and Oliver discussed three different implementations of auto-categorisation systems, demonstrating advantages and issues with each approach. The approaches were:

  • using Verity Intelligent Classifier (VIC) and a taxonomy with a set of rules that could be finely tuned
  • applying a rule-based automatic classification system combined with the author’s review and corrections to produce BBC content that could be described in detail. The approach
  • a “statistical-based auto-categorisation” project designed to connect and cross-reference distributed BBC content and resources horizontally

Add comment 8 November 2008

McGovern makes arguments I support

In recent blogs, Gerry McGovern makes arguments close to my way of thinking.

  • Choosing the right classification words
    argues for the need to ensure content creators use the words that your audience use rather than corporate/PR/government language. Of course, in many cases the “official” language has to be used, but it’s important to optimise content for the non-official terms that your audience actually uses.
  • Obsessed by technology reminds us that technology isn’t the total solution to content management. However good the tools, there needs to be editorial/business input.

Add comment 25 October 2008

Semantic Analysis Technology

Former and current colleagues speaking at Semantic Analysis Technology: in search of categories, concepts & context, an ISKO UK event at UCL on 3 November 08.

Tales from the trenches of auto-categorisation: three case studies in the implementation of auto-categorisation systems

This session will look at three different implementations of auto-categorisation systems in large media organisations. We will look at common themes that have recurred time and time again. The focus will be on ‘do you really need it?’, managing expectations and tips based on the combined experience of the last 5 years. Helen Lippell, Karen Loasby and Silver Oliver are all Information Architects who have been involved in the management of controlled vocabularies and the systems that are used to automatically apply them..This session will look at three different implementations of auto-categorisation systems in large media organisations. We will look at common themes that have recurred time and time again. The focus will be on ‘do you really need it?’, managing expectations and tips based on the combined experience of the last 5 years. Helen Lippell, Karen Loasby and Silver Oliver are all Information Architects who have been involved in the management of controlled vocabularies and the systems that are used to automatically apply them..

Add comment 4 October 2008


Categories

 

December 2009
M T W T F S S
« May    
 123456
78910111213
14151617181920
21222324252627
28293031  

Blogroll

Tags

Act on CO2 Adsense Advanced search and communications conversation crowd sourcing culture customisation design Directgov eye tracking google government Hitwise information architecture internet world ISKO keywords links long tail Obama ontology or personalisation pse RDFa redundancy RSS search search behaviour search engines semantic web serp snippets target position titles universal search URLs useit wiki Yahoo!