Sometimes debugging with ADT can be really fun!
We were playing with the upcoming demo app in an API 17 hardware-accelerated emulator, and decided to do an HPROF run just to see where things were at. After all, it just takes a couple clicks to get one!
The results were not surprising: majority of the time is spent in the rendering cycle. It was not too bad, but we found some places to eliminate per-frame re-calculations. The next HRPOF run should be better.
Emulator performance is tolerable even with 30 FPS and 100ms time base tick. Also not sure if it's actually performing 30 FPS in the emulator, because excess DRAW_FRAME messages are discarded.
GCs are a fact-of-life in managed code, so we will be doing some memory analysis soon, but be aware we are always looking at that. We are definitely relying on the concurrent GC to take minimal time, and on modern hardware <10ms is typical.