From our friends in Southhampton (correction: and Hasso-Platner), a study of how to differentiate experts (who really know how to tag stuff) from spammers (who want to tag their own stuff, but try to acquire credibility by copying tags others have used). They try to exploit the difference that the people who tag first are [...]
Mark Carman presented on the paper in the title. They’re interested in studying personalization, but for that they need personalized relevance judgements. Query logs are a great source of that information but aren’t available due to privacy concerns. So they started looking at whether tag data (public) could be used as a substitute for query [...]
Microformats and RDFa provide a way to interweave semantic markup within a web document so that structured information can be more easily extracted. Both Microformats and RDFa follow the hierarchical model of HTML: structured data to be extracted may exist spread across several layers of the DOM
hierarchy. A pseudocode example of this is below, where we see that [...]
A CIDR presentation by Ken Smith from Mitre on the use of the “match” operation that pairs properties of two different schema. It’s used to merge data from two different sources. He’s arguing that there are tons of uses of schema matching that precede the actually merging of data.
When you are trying to decide whether [...]
This post about ISWC got lost in the drafts pile, so I’m publishing it a bit late.
The folks from Tagora spoke about the relationships between different tags that get applied to the same document, and the tags that different people apply to the same documents. The looked at the ternary relation (user, resource, tag) and [...]
I’m composing the first of what I hope will be many posts describing information tasks that should be easy (ideally, point-and-click easy) but instead seem to be very hard (requiring programming). I post these in the hope of
finding out about tools that do make these tasks easy, or
fomenting thought about why these tasks are [...]
This post was going to be an email to a project mate in my database class, but I found myself wanting to markup the text in HTML so I figured I’d put it here.
Unlike web specifications like HTML, which have only one representation on paper/disk, RDF is just an abstract model — it doesn’t matter [...]
What One Word Describes Your Current State of Mind?
Visitors are asked to “Enter the word that best describes your current mood. You can submit a response every hour. This page will update with the most popular choices from NYTimes.com readers.”
Here are a few projects that won awards at ISWC (Just a sampling of the one’s that most caught my eye. I’ve already lost track of which won what):
Michiel Hildebrand won for his work on effective interfaces for exploring “cultural heritage” (art and history) collections.
Sofia Angeletou won for her work on “semantically enriching folksonomies”—using [...]
They ask, what use are these online profiles of interests? One application would be personalization. They observe that individual’s tag clouds often suggest what they are interested in. They gather and correlate tags from a user’s many different accounts (they find them by you typing in your home page url; this is generally associated with [...]