Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems

Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation / Ferrara, M., Ciccia, C.. - In: WSEAS TRANSACTIONS ON CIRCUITS AND SYSTEMS. - ISSN 2224-266X. - 24:(2025), pp. 298-309. [10.37394/23201.2025.24.31]

Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation

Ferrara M.
Conceptualization
;
2025-01-01

Abstract

Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems
2025
Activation Functions, Deep Neural Networks, Decision-Making Systems, Reinforcement Learning, Markov Decision Processes, Gradient Flow, Temporal Credit Assignment
File in questo prodotto:
File Dimensione Formato  
Ferrara-Ciccia_2025_WSEAS_Activation Functions_editor.pdf

accesso aperto

Descrizione: Articolo
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 750.42 kB
Formato Adobe PDF
750.42 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12318/165208
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact