Group software and corpora
AGH Signal Processing Group developes and owns computer applications and data bases for speach technology research:
- OrtFon - program for phonetic transcription for Polish,
- HODB database,
- HTK model for Polish - ready for use,
- 4 Polish text corpora downloaded from Internet (books, journals, professional speech transcriptions, Internet articles) - around 10GB of text,
- additional, hand-made word anotations for LUNA corpus (MLF files),
- 3 corpora with Polish names (PRP, LW, ANWIL) - around 130,000 in total.