×
Pytorch Cross-modal Transformer between Audio and Text. pytorch cross-modal Transformer using two features: MFCC from audio signal (1-channel); BERT last ...
Missing: carat url? opi= 89978449 maps. google. maps? q= carat+ url% 3Fq% 3Dhttps:// md&usg= AOvVaw04qUM_GA_MmDZJYx_ozntL&um= 1&ie= UTF- 8&ved= 200713&ictx=
cross-modal model between audio(MFCC) and text(KoBERT) - audiotext-transformer/datasets.py at master · Donghwa-KIM/audiotext-transformer.
Missing: carat url? opi= 89978449 maps. google. maps? q= carat+ url% 3Fq% 3Dhttps:// md&usg= AOvVaw04qUM_GA_MmDZJYx_ozntL&um= 1&ie= UTF- 8&ved= 200713&ictx=
cross-modal model between audio(MFCC) and text(KoBERT) - audiotext-transformer/utils.py at master · Donghwa-KIM/audiotext-transformer.
Missing: carat audio/ url? opi= 89978449 maps. google. maps? q= carat+ url% 3Fq% 3Dhttps:// md&usg= AOvVaw04qUM_GA_MmDZJYx_ozntL&um= 1&ie= UTF- 8&ved= 200713&ictx=
Google Drive. To prepare the dataset: Put downloaded zip files under data directory, and run data_unzip.sh to extract the zip ...
Missing: carat opi= 89978449 maps. maps? q= carat+ 3Fq% 3Dhttps:// Donghwa- KIM/ audiotext- blob/ master/ md&usg= AOvVaw04qUM_GA_MmDZJYx_ozntL&um= 1&ie= UTF- 8&ved= 200713&ictx=
STEP1: Convert input audio to text using Google ASR API; STEP2: Extract MFCC feature from input audio; STEP3: Conduct MLM on KoBERT through colloquial ...
Missing: carat url? opi= 89978449 maps. maps? q= carat+ url% 3Fq% 3Dhttps:// blob/ md&usg= AOvVaw04qUM_GA_MmDZJYx_ozntL&um= 1&ie= UTF- 8&ved= 200713&ictx=