



Ask Our Expert: July 2006



|

|
What is an XML lexicon?
-
Lexicons come in two flavors: word lexicons and value lexicons.
Word lexicons are dictionaries of the word tokens used throughout the database. Word lexicons can span the entire content of the database, or can be constrained to the content found within a particular element or element-attribute pair.
Value lexicons are dictionaries of specific sets of values used across the entire database. Because values relate to QNames, specific value lexicons relate to specific elements or element-attributes.
In many ways, lexicons are a great demonstration of how a query language can make 1+1=10. Lexicons themselves are deceptively simple. After all, what could be easier than a list of words or values? But being able to programmatically access these dictionaries from the middle of an XQuery module unleashes a powerful combinatoric effect: because it's easy to combine lexicon access with the other features of MarkLogic Server, you end up with an endless variety of use cases.
Read more about XML Lexicons at xq:zone
http://developer.marklogic.com/columns/smallchanges/2006-06-23.xqy
What is an XML classifier?
-
XML classification lets you classify at any level of granularity - from the individual element to the entire document. It also makes classifications more precise by taking account of content, structure and the combination of content and structure. When you stop to think about it, it's not rocket science to realize that "heart attack" in a title should be treated differently from "heart attack" in a footnote when you're comparing two documents. And yet, most classifiers can't tell the difference. Our XML classification engine is state-of-the-art in every respect, and that has big ramifications for your content applications.
|

|
|