In this paper we investigate approaches to select a set of sentences for speech samples to train acoustic models for Ukrainian both TTS and ASR systems. An algorithm that is not widely known is introduced and another one is applied. Several sub-word units are analysed: phoneme, phoneme-triphone and open syllable. Some experimental results are given and discussed.