Issue Date | Title | Author(s) | Relation | scopus | WOS | Fulltext/Archive link |
---|---|---|---|---|---|---|
2024 | SpeechCLIP : Self-supervised multi-task representation learning for speech via CLIP and speech-image data | Hsuan-Fu Wang; Yi-Jen Shih; Heng-Jui Chang; Layne Berry; Puyuan Peng; Hung-yi Lee; Hsin-Min Wang ; David Harwath |