I’m just wondering how much hard drive (SSD) space I need to be considering when turning on the “Generate voice activity data” option?
I know the Credits and Intro detections only use markers, so there isn’t much there, but it isn’t clear to me how much data is needing to be stored to allow this dynamic auto-sync.
Looking forward to trying out this feature!
EDIT: Is it dependent on the number of subtitles as well as the play length of the media?
I don’t know for sure but I’m guessing that it converts the audio track into a series of start and end times for when voice is detected. So it would be around as big as an SRT file. Perhaps the certainty percentage is also stored since not all audio frequencies existing in the range of human voices will always be 100% certain that they are human voices.
The subtitle files are compared to the stored audio data, so it is not analyzed and saved. It is only analyzed then the resulting offset is saved.