Generate multilingual speech from text
Detect anime faces and landmarks in an image
Towards Unified Music Emotion Recognition across Dimensional