Pilot test collection for the HOMED Project. 11 paired files with publicly available audio files with conversations in the medical domain. The material was collected by coauthor Toine Pieters. The recordings contain a mixture of interviews, patient doctor consultations. Consent is given by the interviewees for reuse for educational and research purposes only.
At Radboud we added the manual transcriptions to the recordings. This test set can be used for testing purposes in of language and speech technology in the medical domain.
Audio file and corresponding transcription file have the same name. They only differ in extension (wav for the audio; txt for the transcription). Speaker indications and time stamps are part of the transcriptions.