About

3rd year Ph.D. Student

Department of Electrical & Computer Engineering

Worcester Polytechnic Institute, MA, USA

Welcome to my website! I am pursuing my Ph.D. under the supervision of Dr. Bashima Islam at the at the Department of Electrical and Computer Engineering, WPI . I am an on-device deep learning researcher working on the integration and analysis of multimodal data to enhance real-time AI applications.

My research interests lie at the intersection of deep learning, audio and speech signal processing , cyber-physical systems, and the Internet of Things (IoT). I am particularly interested in developing DNN-based algorithms that can efficiently process multimodal (audio, video, sensor) data on resource-constrained IoT and embedded system devices. My primary focus is on designing robust deep neural architectures that can adapt to dynamic real-world scenarios, such as erroneous or missing sensor input, without compromising performance.

I worked as a Research Scientist Intern at Meta Reality Labs with the Audio Research Group in the summer of 2025. I am currently working as a Part-Time Student Researcher with the same group. Previously, I worked as a Software Engineer, AI & IoT at Advanced Chemical Industries Limited (ACI) .

Priot to joining WPI, I completed my Bachelor's in Electrical and Electronic Engineering from Bangladesh University of Engineering and Technology (BUET).

Please take the time to visit my website to learn more about myself, my research, and my professional experiences. Whether you are a fellow engineer, researcher, potential collaborator, or just interested in talking to me about something, please feel free to contact me via email.

Download CV


News

08-2024

I will be joining Meta Reality Labs as Part-Time Student Researcher.

06-2024

Our paper got accepted at INTERSPEECH, 2024.

05-2024

I will be joining Meta Reality Labs as Researche Scientist Intern.

05-2024

Our paper FreeML got accepted at EWSN, 2024.

11-2023

Passed Ph.D. diagonstic exam.

05-2023

Started Working as Graduate Research Assistant at BASH Lab, ECE, WPI.

08-2022

Started Working as Graduate Teaching Assistant at Dept. of ECE, WPI.

08-2022

I will be starting my Ph.D. at BASH Lab, ECE, WPI.

02-2021

Received my Bachelor of Science in Electrical and Electronic Engineering from Bangladesh University of Engineering and Technology (BUET).

02-2021

Defended my undergrad thesis titled "A Deep Learning Based Energy Efficient Downlink Power Control Mechanism for Cellular Networks".

01-2021

I will be starting as Software Engineer, AI & IoT at ACI, Limited.


Education

Ph.D. in Electrical and Computer Engineering
Worcester Polytechnic Institute, Worcester, MA, USA
Expected August 2026
BSc. in Electrical and Electronic Engineering
Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
February 2021

Thesis: A Deep Learning Based Energy Efficient Downlink Power Control Mechanism for Cellular Networks


Publications

Missingness-resilient Video-enhanced Multimodal Disfluency Detection

Payal Mohapatra, Shamika Likhite, Subrata Biswas, Bashima Islam, Qi Zhu

INTERSPEECH'24

Most existing speech disfluency detection techniques only rely upon acoustic data. In this work, we present a prac- tical multimodal disfluency detection approach that leverages available video data together with audio. We curate an audio- visual dataset and propose a novel fusion technique with unified weight-sharing modality-agnostic encoders to learn the tempo- ral and semantic context. Our resilient design accommodates real-world scenarios where the video modality may sometimes be missing during inference. We also present alternative fusion strategies when both modalities are assured to be complete. In experiments across five disfluency-detection tasks, our unified multimodal approach significantly outperforms Audio-only uni- modal methods, yielding an average absolute improvement of 10% (i.e., 10 percentage point increase) when both video and audio modalities are always available, and 7% even when video modality is missing in half of the samples.


Memory-efficient Energy-adaptive Inference of Pre-Trained Models on Batteryless Embedded Systems

Subrata Biswas, Pietro Farina, Eren Yildiz, Khakim Akhunov, Saad Ahmed, Bashima Islam, Kasim Sinan Yildirim

International Conference On Embedded Wireless Systems and Networks (EWSN'24)

Batteryless systems frequently face power failures, requiring ex- tra runtime buffers to maintain inference progress and leaving only a memory space for storing ultra-tiny deep neural networks (DNNs). Besides, making these models responsive to stochastic energy harvesting dynamics during inference requires a balance between inference accuracy, latency, and energy overhead. Recent works on compression mostly focus on time and memory, but of- ten ignore energy dynamics or significantly reduce the accuracy of pre-trained DNNs. Existing energy-adaptive inference works modify the architecture of pre-trained models and have significant memory overhead. Thus, energy-adaptive and accurate inference of pre-trained DNNs on batteryless devices with extreme memory constraints is more challenging than traditional microcontrollers. We combat these issues by proposing FreeML, a framework to optimize pre-trained DNN models for memory-efficient and energy- adaptive inference on batteryless systems. FreeML comprises (1) a novel compression technique to reduce the model footprint and runtime memory requirements simultaneously, making them exe- cutable on extremely memory-constrained batteryless platforms; and (2) the first early exit mechanism that uses a single exit branch for all exit points to terminate inference at any time, making models energy-adaptive with minimal memory overhead. Our experiments showed that FreeML reduces the model sizes by up to 95×, supports adaptive inference with a 2.03 − 19.65× less memory overhead, and provides significant time and energy benefits with only a negligible accuracy drop compared to the state-of-the-art.



Work Experience

BASH LAB
Worcester, MA, USA
Graduate Research Assistant
May 2022 – Present
Meta Reality Labs
Redmond, WA, USA
Part-Time Student Researcher
August 2024 – Present
Meta Reality Labs
Redmond, WA, USA
Research Scientist Intern
May 2024 – August 2024
Dept. of ECE, WPI
Worcester, MA, USA
Graduate Teaching Assistant
August 2022 – May 2023
Advanced Chemical Industries Limited
Dhaka, Bangladesh
Software Engineer, AI & IoT
February 2021 – August 2022

Awards

08-2022

1st Runner up at Robi Datathon 2.0.

10-2020

5th at IEEE Video and Image Processing Cup.

04-2020

4th at IEEE Signal Processing Cup.

06-2019

Winner of Bangladesh Section, IEEE YESIST12 Innovation Challenge 2019.


Contact

Call:

+1 508 535 2131