In medicine often questions are posed as pictures (identify the marked structure). Also sometimes you have to identify sounds (eg heart beats).
It would be very useful to have these multimedia data available in questions (and answers too).
There needs to be text too to accompany the picture and sounds, as the questions are asked as text.
I do not expect to make it easier as I would assume including all the Quicktime data types is easiest, but to start jpg and mp3 would suffice.
Example screen
Logged In: YES
user_id=1562722
Originator: YES
File Added: thorax.jpg