Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems
Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation / Ferrara, M., Ciccia, C.. - In: WSEAS TRANSACTIONS ON CIRCUITS AND SYSTEMS. - ISSN 2224-266X. - 24:(2025), pp. 298-309. [10.37394/23201.2025.24.31]
Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation
Ferrara M.
Conceptualization
;
2025-01-01
Abstract
Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems| File | Dimensione | Formato | |
|---|---|---|---|
|
Ferrara-Ciccia_2025_WSEAS_Activation Functions_editor.pdf
accesso aperto
Descrizione: Articolo
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
750.42 kB
Formato
Adobe PDF
|
750.42 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


