List of Publications

Journal

2022

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement, NeurIPS ENLSP Workshop, 2022.

A. Pimentel, H. Guimarães, A. Avila, M. Rezagholizadeh, T. Falk,

How Robust is Robust wav2vec 2.0 for Edge Applications?: An Exploration into the Effects of Quantization and Model Pruning on “In-the-Wild” Speech Recognition, Edge Intelligence Workshop, 2022. 

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for ”In the Wild” Edge Applications, Edge Intelligence Workshop, 2022.

2021

A. Avila, D. O’Shaughnessy, T. Falk,

Automatic Speaker Verification from Affective Speech Using Gaussian Mixture Model Based Estimation of Neutral Speech Characteristics, J. Speech Communicationn, vol. 132, p. 21-31, 2021.

A. Avila, J. Alam, F. Prado, D. O’Shaughnessy, T. Falk,

On the Use of Blind Channel Response Estimation and a Residual Neural Network to Detect Physical Access Attacks to Speaker Verification Systems, J. Computer Speech & Language, vol. 66, 2021.

2020 -2019

A. Avila, D. O’Shaughnessy, T. Falk,

Non-intrusive Speech Quality Prediction Based on the Blind Estimation of Clean Speech and the i-vector Framework, J. Quality and User Experience, vol. 5, no 1, p. 1-15, 2020.

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

On the Use of the I-vector Speech Representation for Instrumental Quality Measurement, J. Quality and User Experience, 2020, vol. 5, no 1, pp. 1-14, DOI: https://doi.org/10.1007/s41233-020-00036-z.

B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar,

Free-Reference Image Quality Assessment Framework using Metrics Fusion and Dimensionality Reduction, Signal & Image Processing, 2019, Vol. 10, No. 5, pp. 1-14, DOI: 10.5121/sipij.2019.10501.

2018

A. Avila, Z. Akhtar, J. Santos, D. O’Shaughnessy, T. Falk,

Feature Pooling for Spontaneous Speech-Based Emotion Recognition in-the-wild, IEEE Transaction on Affective Computing, 2018, pp. 1-12, DOI: 10.1109/TAFFC.2018.2858255.

Conferences

2023

Wahab, Omar A and Avila, Anderson R.

“A Max-Min Security Game for Coordinated Backdoor Attacks on Federated Learning.” First International Workshop on Machine Learning for securing IoT systems using BigData, submitted (2023).

Raphaël Khoury, Anderson Avila, Jacob Brunelle, Baba Mamadou Camara,

“How Secure is Code Generated by ChatGPT?”, IEEE Systems, Man, and Cybernetics (SMC), Maui, HA, USA, Oct. 2023.

H. Guimarães, Y. Zhu, O. Mengara, A. Avila, and T. Falk,

Assessing the Vulnerability of Self-Supervised Speech Representations for Keyword Spotting under White-Box Adversarial Attacks, IEEE SMC 2023.

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, B. Chen, and T. Falk,

RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness, ICASSP 2023.

2022

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement, NeurIPS ENLSP Workshop, 2022.

A. Pimentel, H. Guimarães, A. Avila, M. Rezagholizadeh, T. Falk,

How Robust is Robust wav2vec 2.0 for Edge Applications?: An Exploration into the Effects of Quantization and Model Pruning on “In-the-Wild” Speech Recognition, Edge Intelligence Workshop, 2022. 

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for ”In the Wild” Edge Applications, Edge Intelligence Workshop, 2022.

A.Avila, etal.

Low-bit Shift Network for End-to-End Spoken Language Understanding, Interspeech 2022, DOI: 10.21437/Interspeech.2022-760.

2021

N.Potdar, A.Avila,C.XING, etal.

A Streaming End-to-End Framework For Spoken Language Understanding, IJCAI 2021, pp. 3906-3914, DOI: 10.24963/ijcai.2021/538.

Y.Cao, N.Potdar, and A.Avila.

Sequential End-to-End Intent and Slot Label Classification and Localization, Interspeech 2021.

2019

A. Avila, H. Gamper, C. Reddy, R. Cutler, I. Tashev, J. Gehrke,

Non-intrusive speech quality assessment using neural networks, ICASSP 2019 – 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 631-635, DOI: 10.1109/ICASSP.2019.8683175.

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Blind Channel Response Estimation for Replay Attack Detection, Interspeech 2019, pp. 2893-2897, DOI: 10.21437/Interspeech.2019-2956.

A. Avila, S. Kshirsagar, A. Tiwari, D. Lafond, D. O’Shaughnessy, and T. Falk,

Speech-Based Stress and Emotion Classification Based on Modulation Spectral Features and Convolutional Neural Networks, 27th European Signal Processing Conference (EUSIPCO) 2019, pp. 1-5, DOI: 10.23919/EUSIPCO.2019.8903014.

B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar,

Blind Image Quality Assessment using SVD based Dominant Eigenvectors for Feature Selection, SIPRO 2019.

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity, QoMEX 2019, pp. 1-5, DOI: 10.1109/QoMEX.2019.8743285. Nominated for Best Paper Award

2018-2016

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition, Interspeech 2018, pp. 3663-3667, DOI: 10.21437/Interspeech.2018-2350..

A. Avila, J. Monteiro, D. O’Shaughnessy, T. Falk,

Speech Emotion Recognition on Mobile Devices Using a New Modulation Spectrum Pooling and Deep Neural Networks, ISSPIT 2017, 2017 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), pp. 360-365, DOI: 10.1109/ISSPIT.2017.8388669.

A. Avila, B. Cauchi, S. Goetze, S. Doclo, T. Falk,

Performance Comparison of Intrusive and Non-instrusive Instrumental Quality Measures for Enhanced Speech, IWAENC 2016, 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, DOI: 10.1109/IWAENC.2016.7602907.

2014

A. Avila, M. Santos, F. Fraga, and T. Falk,

The Effect of Speech Rate on Automatic Speaker Verification: a Comparative Analysis of GMM-UBM and I-vector Based Methods, 12th Audio Engineering Conference (AES-Brazil), May 2014.

A. Avila, M. Paja, F. Fraga, D. O’Shaughnessy, and T. Falk,

Improving the Performance of Far-Field Speaker Verification Using Multi-Condition Training: The Case of GMM-UBM and i-vector Systems, Interspeech’2014.

A. Avila, M. Santos, F. Fraga, and T. Falk,

Investigating the use of Modulation Spectral Features within an Ivector Framework for Far-Field Automatic Speaker Verification, International. Telecommunications Symposium, 2014.

2013-2011

A. Avila, F. Prado, G. Kobayashi, E. Rocha,

Performance Comparison of Overdetermined Multilateration Algorithms for Estimating Aircraft Position. In: Workshop on Distance Geometry and Applications (DGA), 2013, Manaus.

A. Avila, M. Paja, F. Fraga,

Proposta de um Sistema de Diálogo Automático Baseado em Algoritmos de Aprendizado Por Reforço. In: Proceedings of the 10th AES Brazil Conference. Rio de Janeiro: Audio Engineering Society, 2012. v. 1. p. 75-78.

A. Avila, M. Paja, F. Fraga,

Integracão de Sistemas de Reconhecimento, Tradução e Síntese Automática da Fala para Facilitar a Comunicação de Turistas. In: The 14th LAC AES Conference. Montevideo: Audio Engineering Society, 2011. v. 1, p. 1-4.