-After a couple of days reading the ARM docs for CM0, and messing with GCC's inline assembly syntax (it's *so* intuitive... ), I think I've got a workable context switch/thread-start.
-My stm32f0 board has the world's worst onboard debugger, so I need to order something better to test with.
-Also, let it be known that the code density on Cortex M0 is really not that great. The kernel compiles to about 48K with -0s...