The desktop app uses mpv as player engine. It has the largest network buffer of all app types, so it can prefetch large portions of videos.
There is no need to go from mp4 to mkv, if you prefer mp4. You can remux affected files equally well to mp4, if you explicitly enable interleaving.
See Losslessly repair mp4 files without interleaving for instance.