Metamorph Multimodal Understanding And Generation Via Instruction Tuning — recommended 1 times