Publications

Publications by categories in reversed chronological order. For more details, please visit my Google Scholar profile.

2025

  1. Data-Driven White Noise Gain Constrained Robust Superdirective Beamformer for Speech Enhancement
    Hanchen Pei, Gongping Huang, Jilu Jin, and 4 more authors
    In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  2. Rethinking mamba in speech processing by self-supervised models
    Xiangyu Zhang, Jianbo Ma, Mostafa Shahin, and 2 more authors
    In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  3. diff-sage.png
    Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
    Saksham Singh Kushwaha, Jianbo Ma, Mark R. P. Thomas, and 2 more authors
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

2024

  1. here_all.png
    Gotta hear them all: Sound source aware vision to audio generation
    Wei Guo, Heng Wang, Jianbo Ma, and 1 more author
    arXiv preprint arXiv:2411.15447, 2024
  2. unified_multichannel_ASR.png
    A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model
    Dongdi Zhao, Jianbo Ma, Lu Lu, and 6 more authors
    arXiv preprint arXiv:2401.02673, 2024
  3. wang2024v2a_overview.png
    V2a-mapper: A lightweight solution for vision-to-audio generation by connecting foundation models
    Heng Wang, Jianbo Ma, Santiago Pascual, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  4. low_latency_attention.png
    A low latency attention module for streaming self-supervised speech representation learning (second version of ’low latency attention’)
    Jianbo Ma, Siqi Pan, Deepak Chandran, and 2 more authors
    arXiv preprint arXiv:2302.13451, 2024

2023

  1. low_latency_attention.png
    Low latency transformers for speech processing
    Jianbo Ma, Siqi Pan, Deepak Chandran, and 2 more authors
    arXiv preprint arXiv:2302.13451, 2023

2022

  1. ctc_backprop.png
    Hidden Markov Models and Connectionist Temporal Classification
    Jianbo Ma
    In , 2022

2021

  1. asr_technical_report.png
    ASR technical report
    Jianbo Ma
    In , 2021