First off, I'd trim the 12 seconds of silence at the start of the video. Few better ways to drive down your views than to make people sit through dead air.
Can you improve the lip sync at all? It's pretty off right now.
so vocals are kinda sampled?..
what about music? piano, violin, percussion?