Cross-Modal Data based on Different Matching Patterns

1. Text-Image Data

royalty,history
art
biology
geography
history,royalty,art
literature,music,art
media,music
music
sport
warfare,history

2. Music-Image Data

music: R&B, hip_hop
music: tender, loving
music: electric, dance_pop
music: arousing positive
music: powerful sad