掲示板 Forums - Pre-fetch audio files
Top > renshuu.org > Feature Requests/Improvements Getting the posts
Top > renshuu.org > Feature Requests/Improvements
Audio is regularly delayed, often multiple seconds, before playing, entirely due to network conditions. There are 3 things that could be done to nearly eliminate the issue.
The same could be done for images, but is less critical.
1. Fetch every audio file on the previous card during a quiz. There are multiple ways to preload a file without playing it.
2. Clone all audio files to a more efficient, compressed audio format.
3. Use a free or cheap Cloudflare CDN plan to mirror files on edge nodes, making them far faster to load, increase caching, and reducing outbound network costs from your server.
1. Something like this is planned for later this year
2. The sentence audio needs to be redone at a lower bitrate (it's on my to-do list), but the format is already pretty small, and I'm not, at least at the moment, looking to handle multiple formats based on what the client can handle.
3. Outbound costs are 0 beyond what I pay for base server costs, so that's not an issue. I'd prefer to work on #1/2 first, then see how it works. Additionally, working on regional "paths" to the renshuu server which have shown higher speeds and less variance compared to the usual path that most users' clients take to the server.
Curiosity got the best of me, so I tried a separate batch of the word audio using webm - but the file size actually increased due to the overhead in the files, and how small they were to begin with. Trying out .ogg (for compatible clients), much better results
Making good progress with the sentence audio getting smaller. Almost 66% of the size was cut out, as the voice artists send me the files in high quality formats, and I never got around to resizing them.
I will have settings available tomorrow to test both the new sentence audio, and the new word audio (word is using .ogg). If both work, I can switch the sentences over to the .ogg as well for even more of a speedup.
Great to hear! That is a significant reduction!
Curious if you are using Ogg Vorbis or Ogg Opus. Opus seems like a more modern standard for web voice audio.
Do you have network file compression added as well? May not see much more of a boost, but could benefit anyone on a slower network.
For any devops, CDN edge nodes are usually one of the first things setup for a new site with any media content. It is incredibly simple and can sometimes cut ~75% of the egress time, thanks to reduced latency. Self-hosted/rented servers are rarely egress optimized, and don't get priority on networks.
Another simple improvement could be adding long expiration cache headers (if not set already). The files basically never change, and many files will be repeatively requested day over day. The now-decreased file sizes will allow far more to be stored before cache eviction.
Another improvement that could improve speed perception is to auto-trim dead space at the start and end of the audio files. I regularly encounter ones that take a sec to start after each time clicking the button.
That would need some trial and error to make sure to tool correctly trims. Either way, make sure the originals are solidly backed up :)
Thanks again for taking a look. Glad to pique your curiosity
It's honestly something I do need to look into. I feel like the images world be harder to do, as my current setup calls them with a height parameter, and it builds a new sized filewhen necessary, and webp if client allowed.I imagine all of that works have to be done prior to pushing a file to the cdn? Audio world be much easier, I imagine.
the smaller files are now available behind flags in the experimental settings :)
Oh, and to answer your earlier question: I tried opus first, but sizes were too big. Looks like the overhead is just a few thousand bytes, but when you are doing 1-3 second files (words, some sentences), the overhead makes it less efficient than mp3 or vorbis.
Turned on the experimental settings!
If there are consistent blocks of the files and you want to get really efficient, you could cut those from the blobs before sending and append it to all audio files on the client.
Plenty of other optimizations available before that though