Ain't going to happen, because a microcontroller does not have the bandwidth to handle a video stream unless we're talking some really low resolution. There is some capacity to handle the audio though.
A microcontroller can send configuration commands to a decoder chip, set picture-in-picture modes, whatever the chip is capable of. But it can't handle the video stream.