Click here to receive your FREE subscription to Campus Technology
Home > Semantic Search: Could the Web Think?
Opinion
Semantic Search: Could the Web Think?
7/16/2008
By Trent Batson
One way to improve search, then, is to start organizing information better to begin with. If you were searching a library where books had just been dumped willy-nilly (as have resources on the Web), you'd also have difficulty finding the right printed information. One current approach to better organization up-front is through the Resource Definition Framework (RDF):
"The RDF metadata model is based upon the idea of making statements about Web resources in the form of subject-predicate-object expressions, called
triples in RDF terminology. The subject denotes the resource, and the predicate denotes traits or aspects of the resource and expresses a relationship between the subject and the object. For example, one way to represent the notion "The sky has the color blue" in RDF is as the triple: a subject denoting "the sky", a predicate denoting "has the color", and an object denoting "blue"." --Wikipedia
In other words, we're using natural language (ordered in this case on the syntax of Western languages) as a model for our standard descriptors of Web information.
Now, we have a way to describe individual resources. These descriptions define the resources, so search can produce more relevant results. But what about resources that are related semantically but don't have the full set of descriptors in the search?
If we are basing our search to find a web of related resources, we need those resources to also have definitions about their relationship to other resources. We need a Web ontology. The Web Ontology Language has been produced by the WC3:
"The data described by an OWL ontology is interpreted as a set of "individuals" and a set of "property assertions" which relate these individuals to each other. An OWL ontology consists of a set of axioms which place constraints on sets of individuals (called "classes") and the types of relationships permitted between them. These axioms provide semantics by allowing systems to infer additional information based on the data explicitly provided." --Wikipedia
But, Does it Work?Machines should now be capable of using a new Web language to talk to "individuals" who have properties. And, then these individuals (or the database in which they reside) will lead people to semantically related other individuals (sets of data). This improved search targets content that is more reliably relevant than current searches produce, and then the content is placed within a context of meaningfully related other content.
The question right now is when will enough organizations ontologize their resources so that a true semantic search will be possible? I used Hakia,
http://www.hakia.com/, "a new semantic search engine," to do the same search about gardening with no better results than Google. (Google already includes some semantic elements in its algorithm, however, which probably made its results somewhat closer to Hakia's.)
There is hope, however, and maybe a hint of a trend. A large number of major corporations, and other large organizations, are in the process of semanticizing their Web holdings. See:
http://www.w3.org/2004/01/sws-testimonial.
And stay tuned. Artificial intelligence research, on which the goal of a Semantic Web is based, always seems to take longer to produce results than we thought. The Semantic Web is not a reality yet. When it is a reality, will it be able to "think"? Not really, but I hope it can at least convince me that gardening is more work than it's worth.
Trent Batson, Ph.D. has served as an English professor, director of academic computing, and has been an IT leader since the mid-1980s. He is currently Co-Lead for the Web2ePortfolio Initiatve (W2eP), a Senior Associate with the TLT Group, and Editor of Campus Technology's Web 2.0 e-newsletter. batsontr@mit.edu
Cite this Site
Trent Batson, "Semantic Search: Could the Web Think?," Campus Technology, 7/16/2008, http://www.campustechnology.com/article.aspx?aid=65418
copy text (above) for proper citation