# Conformer - [Conformer: Convolution-augmented Transformer for Speech Recognition](https://arxiv.org/abs/2005.08100) - [Transformer](Transformer.md) models are good at capturing content-based global interactions, while CNNs exploit local [Features](Features.md) effectively - integrating components from both CNNs and Transformers for end-to-end [speech recognition](speech recognition.md) to model both local and global dependencies of an audio sequence in a parameter-efficient way - importance of each component, and demonstrated that the inclusion of convolution modules is critical to the performance of the Conformer model - propose the convolution-augmented [Transformer](Transformer.md) for [speech recognition](speech recognition.md), named Conformer - [LibriSpeech](LibriSpeech.md)