Jianbo Ma

This is a home page of Jianbo Ma, who is a researcher in the area of machine learning/deep learning. The world is changing fast and we ought to share and exchange ideas more frequently. This motivates me to create this page, in order to share ideas with you who may also have the same interests.

news

No news so far...

latest projects

latest posts

Dec 12, 2023 Quick Paper Post

selected publications

  1. here_all.png
    Gotta hear them all: Sound source aware vision to audio generation
    Wei Guo, Heng Wang, Jianbo Ma, and 1 more author
    arXiv preprint arXiv:2411.15447, 2024
  2. diff-sage.png
    Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
    Saksham Singh Kushwaha, Jianbo Ma, Mark R. P. Thomas, and 2 more authors
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  3. unified_multichannel_ASR.png
    A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model
    Dongdi Zhao, Jianbo Ma, Lu Lu, and 6 more authors
    arXiv preprint arXiv:2401.02673, 2024
  4. wang2024v2a_overview.png
    V2a-mapper: A lightweight solution for vision-to-audio generation by connecting foundation models
    Heng Wang, Jianbo Ma, Santiago Pascual, and 2 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  5. low_latency_attention.png
    A low latency attention module for streaming self-supervised speech representation learning (second version of ’low latency attention’)
    Jianbo Ma, Siqi Pan, Deepak Chandran, and 2 more authors
    arXiv preprint arXiv:2302.13451, 2024
  6. low_latency_attention.png
    Low latency transformers for speech processing
    Jianbo Ma, Siqi Pan, Deepak Chandran, and 2 more authors
    arXiv preprint arXiv:2302.13451, 2023
  7. ctc_backprop.png
    Hidden Markov Models and Connectionist Temporal Classification
    Jianbo Ma
    In , 2022