Date of Award

8-1-2025

Degree Name

Doctor of Philosophy

Department

Electrical and Computer Engineering

First Advisor

Tragoudas, Spyros

Abstract

The widespread adoption of deep learning has led to a surge in demand for the efficient deployment of Deep Neural Networks (DNNs) on embedded and resource-constrained systems. These platforms are increasingly tasked with performing complex inference workloads in real time, yet they must do so under strict limitations on energy consumption, computation, and memory resources. As DNN models grow in complexity to support advanced applications such as time-series forecasting and image classification, ensuring their efficient execution without compromising performancebecomes a key design challenge. This calls for tailored algorithmic and architectural solutions that can meet application-specific constraints while maximizing the benefits of modern hardware accelerators.This dissertation proposes a set of algorithmic-based approaches aimed at enhancing the efficiency and adaptability of deep learning models for embedded systems. The research centers on three primary contributions: (i) a hybrid Long Short-Term Memory (LSTM)-Transformer architecture optimized for multi-step residential power load forecasting, integrating sequence modeling and attention mechanisms for improved accuracy under training time constraints, (ii) a lightweight hierarchical deep neural network enhancement that augments baseline classifiers through cascading binaryConvolutional Neural Networks (CNNs) and Vision Transformers, achieving higher classification accuracy and reduced inference time, and (iii) an energy-aware scheduling framework for DNN inference, featuring dynamic batch size selection, GPU frequency adjustment, and concurrent task mapping across multi-GPU systems to minimize energy use without violating real-time deadlines.In summary, this dissertation presents a comprehensive approach to designing and managing deep learning workflows on embedded platforms, emphasizing performance-efficiency trade-offs. The proposed methods demonstrated significantimprovements in both predictive accuracy and resource utilization, compared to existing solutions, offering practical pathways for deploying DNNs in sustainable, computing environments.

Share

COinS
 

Access

This dissertation is only available for download to the SIUC community. Current SIUC affiliates may also access this paper off campus by searching Dissertations & Theses @ Southern Illinois University Carbondale from ProQuest. Others should contact the interlibrary loan department of your local library or contact ProQuest's Dissertation Express service.