Re: [Challengeos-developers] more ideas

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

sou...@fu... wrote:
> Re: [Challengeos-developers] more ideas
> Von: Philipp Gühring <p.g...@fu...>
> An: cha...@li...
> Datum: Fri, 26 Jul 2002 19:03:55 +0200
> 
> Nachricht enthält Signatur von Philipp Michael Gühring (Sourcerer) 
> <pg...@pa...>
> Hi,
> 
>> This file system will be soleley for data storage.
> 
> 
> Applications could be another idea too. Look at www.entity.cx
> I think that could be the basis for your componentizes modulized 
> architecture
> too.
> 

I'd only do this for interpreted code like Perl or Python fragments 
(pretty much the same thing entity is doing). The reason is that I can't 
think about a way to make this behave correctly for binary code 
fragments. When should which piece of code be executed?

>> It will serve as some
>> sort of hierarchical database. It'll be designed in such a way that an
>> XML file can go through an import/export cycle on this file system and
>> will come out unaltered (maybe perhaps some whitespaces vanished or were
>> added, but that has not changed the contents of the file).
> 
> 
> Ok.
> 

I can think of optional features in DOMfs that can't be mapped directly 
into XML files that way like revision management or access rights. The 
former is a pure convenience function, which might turn out quite 
useful, but the latter is a must-have in the multi-user environment 
ChallengeOS will have to provide.

Currently I'm also pondering wether there should be the possibility to 
define a format for XML subtrees (like it is done with DTDs or similar 
files) and have DOMfs block any future changes that violate these 
definitions. This is strictly optional, even if it should become 
implemented. But is it worth implementing?

>> How is this file system accessed? Among the many ways thinkable I prefer
>> to map the hierarchical structure onto persistent data structures which
>> are accessed from memory. However, this can only be achieved in a
>> reasonable way when an object-oriented language is used. Actually, pure
>> procedural languages like C will have a hard time in ChallengeOS anyway
>> (see below). Evidently, you could design a plain procedural C-compatible
>> interface, but that'll be a lot more complicated to use.
> 
> 
> Yes.
> DOM, and all the necessary functions around it, have to be provided by the
> system kernel, for every kernel driver, for every programming language, and
> for every application.

I agree except for the kernel and device drivers. It's senseless to do 
some XML tricks in this area of an OS. Remember that device drivers are 
about providing unified interfaces to hardware. And for the sake of 
speed they must be as simple, flexible and straight-forward as possible. 
Device drivers will most likely be interfaced via device files, i.e. the 
good old /dev/ dir. This method is fast, clean and can be implemented by 
providing a small number of syscalls (open, read, write, close, ioctrl). 
You surely wouldn't want to "cat randomcode.xml > /dev/xxx" and expect 
it to do something useful with your input, would you?

> I see two necessary functions based on DOM, that should be made available:
> XPath and XSLT.

A question about XPath: Isn't there a supposedly more sophisticated 
successor in preparation?

For XSLT: I think that it would be a good thing to leave the 
implementation of this in user space. It should be available.

> For the programming languages: DOM should be a datatype:
> DOM mytree;

Yes. Something like that. Anything else wouldn't make sense in a C++ 
program. Maybe there should be a parameter passed to the constructor 
telling it what node to take as a root node.

> XPath should be as integrated into the programming languages as Regular
> expressions are integrated in Perl:
> foreach (mytree =~ x/DOCUMENT/NODE/SUBNODE[@attribute='value']/Text )
> {
> print;
> }

Although that syntax is fine with Perl, it should be covered by 
functions and classes in C++. Reason is that it would otherwise hevily 
break the already surprisingly complex C++ syntax (C++ syntax looks 
simple, but it certainly isn't).

> And the kernel should make the following possible:
> DOM persistentTree("/dev/hdd4");
> Which makes the DOMfs from the partition /dev/hdd4 available as DOM tree.
> 

Taking the device file name makes the node definition dependent on the 
physical disk layout in the machine, which is a bad thing to do. Instead 
I think there should be a way to mount existing DOMfs partitions into a 
virtual root node. This way data can be spread across many disks and 
partitions without requiring the application to care about this. So it 
would be similar to the directory tree on UNIX.

>> Note that there must still be conventional file system available because
>> at least the software must be stored on this file system.
> 
> 
> Hmmm. Have a look at Entity.
> I think Entity shows the way, how applications can be developed in the 
> future.
> 

As I already pointed out that I doubt that this could work for compiled 
code. I agree that it's a neat concept they have, though. However, I 
need some hints about how binary code could be included in this concept.

>> The
>> requirements of both file system types absolutely exclude one another.
>> So there is no way to combine them.
> 
> 
> ;-)
> I am not sure yet. I don't think that it's impossible, it just isn't
> straitforward.
> Did you take a deep enough look at IVI::DB yet?

No.

> IVI::DB is my own native XML database, which is available under the GPL 
> from
> http://www.livingxml.net/  (->Plattform ->Database ->bottom of the page)
> With IVI::DB, I somehow succeed to build up a XML database on top of a 
> normal
> hierarchical filesystem.
> 

And what is so special about this? I don't see the point here.

> 
>> 2. Enhanced execution environment
> 
> 
> (The name somehow reminds me of Palladium ... but forget that)
> 

Please sugest a better name. I find it awful too, because it's so long. 
And I'm definitely against the development of Palladium.

>> [Note: This is in my oppinion the most important feature. I'll not give
>> up this one. No way!]
> 
> 
> Did I said anything against it?
> Something like that stumbles through my mind for some years now, but I 
> am not
> yet sure, how I really want it, and I think I want it a bit differently 
> than
> you, but I think the overall direction is not that wrong ...
> 

Well, let's fight it out... ;)

>> This is more than a feature. It's a computing concept unmatched by
>> anything I've seen so far or that is to come in the near future. In
>> other words: it's unique!
> 
> 
> Marketing.
> 

Perhaps. But it's something every project does :).

>> It's hard to describe all of this exactly. Imagine your software
>> consisting of a whole wagonload of small, specialized modules (or
>> libraries) which are running in the same address space. Every module
>> could use functions provided by every other module that is installed.
>> Extending such an environment would be easy: just write the missing
>> module using features from the modules that are already available.
> 
> 
> Sounds like the shared library concept.

It's more than that. It's rather a component model. No module will ever 
have more than one instance (including code *and* data). Consider the 
following example:

Task 1 running module A accesses module B, which is not yet loaded. The 
OS traps this and fires up module B. Module A then forces module B to 
load some data - say a data file like "/tmp/somedata.tmp" from disk. 
Then another task - task 2 - which just got started up also references 
module B. This access is of course trapped. But it does not result in 
loading another instance of module B. Instead task 2 shares module B 
with task 1 from this moment on. This also means that it sees the 
current state of module B, in this case that it has loaded a data file.

In a nutshell, this is what the enhanced execution environment is about. 
Of course it wound't be suitable for a multiuser environment if it can't 
  perform access checks to ensure data and system security.

With a few tweaks to this model it would even be possible that module B 
runs on a remote machine but neither task would ever have to care about 
that.

A final note: Accesses to module B in that example are not performed 
using any special interfaces in the style of CORBA or COM which require 
wrapper code to be created. The accesses are formulated as 
straight-forward, non-wrapped code and no wrapper generator or IDL 
compiler is needed.

> At the moment, we have several different execution environments:
> * Kernelspace (device drivers, ...)

This one is quite special and has nothing in common with any of the 
other environments / spaces.

> * Daemonspace (all those servers ...)
> * GUIspace (KDE, Gnome, Windows, ...  "rich clients")

These two can be unified into a "classic" POSIX-compatible environment.

The question I'm facing is whether this should be isolated from the 
"enanced environement" I've been writing about here or whether the 
latter can be implemented purely as an extension to the former one, 
which would be great thing.

> * Webservicespace (everything running in a Browser "thin client"

I'd call that "interpreter space" because every interpreted programming 
languge can have a set of abstactions to the OS they are running on. The 
interpreter that is neccessary in this case typically runs in another 
environment - normally POSIX-compatible user space. So this has not be 
considered as special.

> All those execution environments have very different needs, and should be
> thought trough on their own, I guess.
> 

Right, though I don't agree with all the border lines you've been 
drawing above.

>> However, this has two rather obvious drawbacks: If only one module has a
>> bug the whole set would be forced down in a big crash. Second, every
>> module involved must be loaded in memory at startup. That's a waste of
>> memory. But these problems can be overcome.
> 
> 
> Have a look at the Perl module concept, and CPAN.
> It is nearly so automated, that it would automatically fetch and install 
> the
> needed modules from the Internet, as soon as you call the first.
> 

This proves that it is possible, especially that it can all be totally 
anonymous (it's something people really want to have).

>> The way to achieve this is quite easy - in theory. Enforcing controls
>> and checks on the environment will give the modules the ability to
>> gracefully handle crashes without pulling down every other module.
>> Furthermore the same mechanism can be used to perform a sort of lazy
>> linking: a module will only be loaded when it is referenced (call'ed
>> into) for the first time and will be unloaded as soon as the last active
>> reference has gone.
>> This is possible in an 80386 and upwards (and pretty likely even other
>> processor architectures as well): This processor has a pretty nifty
>> page-based access control. You can only set page table entries to
>> readable, writeable and/or non-existant, but that's sufficient, because
>> the rest is handled in software.
>> Each module get's its own private adress room which contains "windows"
>> into which the referenced modules are mapped. This mapping might not
>> even be real page-level mapping because of access limitations which need
>> to be enforced.
> 
> 
> Again, this sounds like shared libraries.
> 

I want it to be more than that. See the passage above where I've drawn 
the line between this "enhanced environment" (I'm still looking for a 
better name) and already available component models. I hope that the 
above becomes clearer now, too.

>> Each time a window is accessed the context of the running project
>> changes to the adress space of the referenced module, after appropriate
>> mapping has been done. Accesses through windows are trapped (except when
>>    non-pointer data is read and reading is allowed or other rather
>> trivial cases) and the instruction that has invoked the trap is
>> examined, and  then - if it is valid - emulated by the trap handler. In
>> case the instruction is invalid the module having caused this violation
>> will get the chance to handle the error and exit gracefully.
>> Of course access rights must be defined. This is done on a per-symbol
>> basis (symbol in this context means funcion/procedure entry point,
>> object, variable or data structure). Therefore each module must consist
>> of a binary and an access definition file. In this file there is an
>> entry for each symbol which grants or denies read, write and execute
>> rights for the owner of the module, his/her group and others (note that
>> making an extra file out of this has two benefits: first, there's no new
>> file format needed, and second this file could possibly edited by an
>> user or admin). In this file prototypes of each exported function and/or
>> variable must resides as well as definitions of exported data
>> structures, because during the adress space switch pointer addresses
>> might have to be tweaked so that they point into the right window
>> (imagnie that the process is tunneling back and forth between two
>> windows which map address spaces that have different real offsets). This
>> might not be neccessary when the windows are at the same addresses as
>> the modules that are referenced within their own address space.
>> This mechanism can be extened even further: Windows can map modules
>> running on remote machines. This only needs a small extension in the
>> form of a network protocol stack which is able to serialize and
>> reassemble such requests automatically. Furthermore this mechanism can
>> be exploited to map contents of the DOMfs into persistent objects and
>> thereby providing a decent interface.
> 
> 
>> All of this sounds like a lot of overhead. But I assume that it isn't. I
>> expect that code controlled this way is no more than two to three times
>> slower than a usual executable under otherwise identical conditions. And
>> that performance hit isn't recognizeable for desktop users given the
>> performance of current PC hardware.
>> 3. An interpreted language to automize the enhanced execution
>> environment: ObjectBasic
>> This is an optional thing and can be described as the "shell" of the
>> enhanced executino environment I just described. The language should be
>> easy to use, yet powerful enough to write small applications and
>> automization scripts for every day use.
> 
> 
>> Have you ever thought of
>> remote-controlling your word processor from a shell and writing a letter
>> this way?
> 
> 
> Yes. I have. But afterwards, I found no answer to the question:
> "And why should I?"
> 

This example should point out the ease of component reusage and 
automation that is possible. So both software integration and software 
development will hopefully become easier.

>> With this interpreter, it'll be possible. I promise. ;)
> 
> 
>> That's all! This is just a short description of the most important
>> points in my design proposal. Of course it is much longer:
>> integrated installer with online software update facility (the installer
> 
> might get
> 
>> things optimized inside the enhanced execution environment quite a lot),
> 
> 
> That one sounds good. CPAN could be a good example, how the technical side
> could work. What I am missing at CPAN is the missing Quality Control.
> 

The problem can only be overcome by manpower. And manpower is by default 
rare among volunteer efforts. But the direction is certainly right.

>> a new and hopefully superior graphical user interface, etc.
> 
> 
> Have a look at "Berlin", which is somehow connected to GGI, and have a look
> at Entity.
> 

Berlin's design is arguably one of the best, I agree. However I'd like 
to direct the focus more on new ways of user interaction and usability 
in general than on implementational details (the latter will possibly be 
determined by the former).

>> What else?
>> An office suite? An ERP system? A web server? I just don't know.
> 
> 
> ;-)
> Do you plan to run SAP on it?

No. Having spent a whole week trying to install CA's Manufacturing 
Knowledge (MK for short, which is a direct competitor to SAP R/3) at 
work and having spent three months trying to get the faintest idea of 
what it can do I think we could easily build our own ERP system on top 
of ChallengeOS :).

Gregor

PS: That MK installation still isn't complete. I've given it up by now :).

-- 
*****************************************************
* Gregor Mueckl                 Gre...@gm... *
*                                                   *
* The ChallengeOS project:                          *
* http://challengeos.sourceforge.net                *
*****************************************************
* Math problems?                                    *
* Call 1-800-[(10x)(13i)^2]-[sin(xy)/2.362x].       *
*****************************************************