Thread: [SMW-devel] Review of Semantic MediaWiki 0.3

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hello,

Matthias Schindler has talked me into doing a short review of the Semantic
MediaWiki extension. The idea of it looks very interesting. It would make
many lists obsolete and would allow a kind of data minig we currently don't
provide. And I'd love to have well structured geographic (meta-)data
available to draw maps.

The current software is Version 0.3, so it's still far from production
readiness. Deficits I've found, in mostly chronological order:

* The installation of the SMW tables is triggered from a special page.
The database user that MediaWiki uses at that point is not allowed to
create tables if the user was created by the MediaWiki installer. A hook
for extensions in the installer could be helpful here.

* The tables are created as type=MyISAM. That's a no-go for sites like
wikipedia.org. MyISAM provides only very poor locking and is only
suitable for read-only databases.

* No indexes are created. All requests require full table scans.
The choice of 'text' as datatype for page titles ('subject') in all
SMW tables makes this even worse - it prevents the creation of proper
indices.

* References to the page table are given as (namespace,title), not using
page_id. While this is nice for querying SMW tables (no join needed),
this results in a potential risk when renaming pages (high write load).

* Properties (like e.g. population, geographic coordinates, birth dates)
are stored as strings. This makes queries likes 'all cities with more
than a million inhabitants' very expensive. Another no-go for a site like
wikipedia.org. Those queries will happen frequently.

* Non-standard way to handle local settings. Should be incorporated into
global LocalSettings.php. Many of these settings should be
auto-detected, esp. smwgServer, SMW_ScriptPath and SMW_IP 

* Naming conventions for variables: SMW_RAPPath, enableTemplateSupport
and glNamespacesWithSemanticLinks are examples for three different
ways to name variables.

* Poor use of the database abstraction layer. If you'd use the
abstraction layer, you wouldn't need to use globals for the table names
and you wouldn't have to quote all fields on your own. You'd also
benefit from future extensions, like query optimization or query
distribution.

* There shouldn't be an editing help link on article pages, only show it
on edit pages.

* I don't like how the relations and attributes are displayed. But I've
no idea yet how to improve it.

For some of these items I've created bug reports on sourceforge, some
also include patches.

Regards,

	jens

Thread: [SMW-devel] Review of Semantic MediaWiki 0.3

Lets you store and query data within the wiki's pages.

semediawiki-devel