" Samples per frame = (sample rate)/FPS" (source: socialsounddesign.com/questions/5022/how-many-samples-are-in-a-frame )

other google result ans also my intuition support this.

i think it should be like that, because a "sample" is one "snapshot" of *all* voltages coming in from your microphone port(s). a sample may consist of either a complex data type (like struct stereo_sample_t { float left; float right;} or just a float or something similar.

can you explain your point or give sources?


i think that MusE uses a wrong nomenclature: often, it refers to
"frames", if it means "samples".

afaik, "frames" are 24th or 30th-fractions of a second (depending on the
video norm; i.e., one frame is one single picture of a movie)

Correct. But you are referring to "timecode" (picture) framerate.

Timecode frames contain /many/ audio frames.

Audio frames on the other hand contain one or more samples.
I believe it is valid to say something like:
"each audio frame contains two samples"
which would be stereo sample frames. And:
"each SMPTE frame holds
(AudioSamplerate / SMPTE_FrameRate)
stereo sample audio frames."

I know terms get mixed up very easily, I'm guilty too.

But I think that's how the explanation goes.


"samples" are 44100th-fractions (or whatever the sampling rate says) of
a second.

did i get that wrong? or is it just wrong overall in muse?


