Removes silence/non-speech sections from speech sample
Calculate distances between speech recordings
Compare audio clips and get alignment details