init_dmo_media_type_video uses head + extra bytes memory, and with MEDIASUBTYPE_RGB8 the extra bytes are already 1024. This leads to stack corruption.
Note that not all of those are strictly necessary to prevent a crash, but I think it's safer in case it becomes relevant in the future.