Abstract: Connectionist temporal classification (CTC) is one of the predominant schemes for end-to-end speech recognition because of its simplicity, efficiency and reliability. However, as a sequence ...