Thread: [SMW-devel] relations and templates

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

[Discussion moved here from talk pages...]

Here are a few more thoughts about why templates seem incredibly
important to this effort, to me.

- Redundancy is always dangerous. All the same arguments against
separate coding of relations outside of the normal article text also
apply to any scheme that requires an extra level of annotation in
either a template or its inputs to add semantic significance. I
suggest that the scheme will be qualitatively more reliable if
templates are defined to have inherent semantic significance. This
might ultimately involve tweaking the template system somehow before
it makes total sense, but the current state of templates seems like a
workable approximation to me.

- Semantic annotations are individually worthless. That is, linking
topic A to topic B isn't interesting in itself. What's interesting is
defining the relationship between things like topic A and things like
topic B, and then linking *all* of the things like topic A to all of
their corresponding things like topic B. You have to approach complete
coverage before your query results become useful. Thus the human work
of defining the relationship schema is central to the whole semantic
process. Templates (particularly infoboxes) are mediawiki's closest
approach to microformats, at the moment, and thus the center of
discussion about, effectively, the common schema of various data
types. Anything we can do so that work done for
input/display/consistency purposes is also, by definition, semantic
progress, will be powerfully in our advantage.

- Although it's obviously possible to build small demos on sample data
(or to use non-public data from other installations of mediawiki), the
big oohs needed to catapult this project into the mediawiki mainstream
will only come from being able to execute a new semantic query against
the live wikipedia to answer some question that up until now could not
have been answered by machine. If we can repurpose template input as
semantic data, we have some hope of doing a genuinely interesting
query on current real data, which seems vastly preferable to having to
go back and hand-code some huge number of relationships. And this
applies even more so, obviously, to future wikipedia data. If semantic
coding is a separate step, it will be done erratically, and all it
takes is a tiny amount of data incompleteness to render semantic query
results effectively meaningless.

glenn

Thread: [SMW-devel] relations and templates

Lets you store and query data within the wiki's pages.

semediawiki-devel