Looking for a way to mass import audio files for Anki cards

October 12, 2021 at 04:19 PM

This post talks about the previous studying routine that I had set up.

It mostly consisted of using the Lua script to automtically generate Anki cards after going through the scripts of a Chinese drama with the help of the Chinese Text Analyzer, a wonderful program made by Imron.

I also played with Auphonics to utilize its Automatic Speech Recognition, thereby generating not only an enhanced version of the audio but also subtitle files.

Recently, I had an eureka moment and got myself a script written in Python.

This script receives a mp3 file and subtitle file as input and results in two output files.

Two output files include a bunch of smaller/shorter audio files split according to the time stamps indicated in the subtitle file for each sentence.

The other output file is an excel spreadsheet that lists the audio file numbers and corresponding texts.

In an effort to have real-human voice recording attached to the Anki cards, I inserted the individual audio files created by the above Python script.

And now, I am wondering, is there a way to add even more automation to this process?

I'd like to automate the part that I am manually adding audio files.

I already have the Python script written in a way that each sentence is numbered and this data is recorded on the spreadsheet.

If anyone can help, please do share their insight and I will appreciate it a lot.

06.csv anki-export.lua subs.py

October 12, 2021 at 06:12 PM

If I understand correctly, I think you've already done all the hard parts. Anki lets you update existing notes by importing text files.

Create a spreadsheet where column A contains the texts that match existing cards and column B contains references to the audio files using Anki's syntax (e.g. "[sound:ni3hao3.mp3]").
Save this spreadsheet as a tab-separated text file.
Create a field in your notes for the audio file reference.
Copy all the MP3 files into Anki's media directory.
Import the text file into Anki, making sure to select the option for updating existing cards.

October 12, 2021 at 06:22 PM

I've done this the way @大块头 suggests too.

I think there is one part missing though. The csv import will only import the cards, but not the media files so you need to manually place the mp3 files in the Anki media folder. When Anki sees that [sound:ni3hao3.mp3] syntax it will just look for that file in the media folder and use it if it finds it.

You can for example create one card manually, then export it into csv, then use that as the template to create the rest of the cards using python, vim macro or whatever, and finally import the cards back to the collection.

An easy way to find the media folder is to just to copy the file name of some file you know works in Anki from the csv and then search for it.

October 12, 2021 at 06:27 PM

On 10/12/2021 at 2:22 PM, alantin said:

I think there is one part missing though. The csv import will only import the cards, but not the media files so you need to manually place the mp3 files in the Anki media folder. When Anki sees that [sound:ni3hao3.mp3] syntax it will just look for that file in the media folder and use it if it finds it.

Thanks, I revised my earlier post to include this step.

October 12, 2021 at 07:42 PM

is auphonics like the auto-generated subtitles on youtube? in that in only work wells if there isn't a lot of background noise/music, sound effects etc

October 12, 2021 at 11:35 PM

On 10/12/2021 at 3:42 PM, brownnoser said:

is auphonics like the auto-generated subtitles on youtube? in that in only work wells if there isn't a lot of background noise/music, sound effects etc

Please look into Auphonics and try for yourself. I don't believe that it's like auto-generated subtitles on YouTube.

I even provided the link in the original post. Results may vary but my experience has been that Auphonics does a pretty good job of generating transcripts/subtitles for me. Also, Auphonics mainly intends to get rid of the background noise and enhance the audio quality. So you get both, which is what I also mentioned in the original post.

Sign In

Looking for a way to mass import audio files for Anki cards

Recommended Posts

pon00050

大块头

alantin

大块头

brownnoser

pon00050

Join the conversation