Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation

IRIS

Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems

Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation / Ferrara, M., Ciccia, C.. - In: WSEAS TRANSACTIONS ON CIRCUITS AND SYSTEMS. - ISSN 2224-266X. - 24:(2025), pp. 298-309. [10.37394/23201.2025.24.31]

Activation Functions for Optimizing Decision-Making in Neural Networks: Mathematical Analysis and Empirical Validation

Ferrara M.^{Conceptualization};Ciccia C.^{Membro del Collaboration Group}

2025-01-01

Abstract

Decision-making systems powered by deep neural networks have transformed artificial intelligence applications across diverse domains. The choice of activation functions fundamentally influences network capacity to learn optimal decision policies, handle uncertainty, and generalize across contexts. This paper analyzes how activation functions impact decision-making processes in neural architectures, examining six fundamental functions: Linear, Sigmoid, Hyperbolic Tangent (TanH), Rectified Linear Unit (ReLU), Parametric ReLU (PReLU), and Exponential Linear Unit (ELU). Through mathematical analysis and empirical validation across decisionmaking benchmarks, we demonstrate that modern activation functions like ReLU and its variants provide superior performance by enabling better gradient flow, faster convergence, and more stable policy learning. Our findings reveal that activation function selection must balance computational efficiency, gradient preservation, and domain-specific requirements, with no single function being universally optimal. We provide quantitative metrics and practical guidelines for architecture design in decision-making systems

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Parole chiave
	
				Activation Functions, Deep Neural Networks, Decision-Making Systems, Reinforcement Learning, Markov Decision Processes, Gradient Flow, Temporal Credit Assignment
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Ferrara-Ciccia_2025_WSEAS_Activation Functions_editor.pdf accesso aperto Descrizione: Articolo Tipologia: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 750.42 kB Formato Adobe PDF Visualizza/Apri	750.42 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12318/165208

Citazioni

ND

0

ND

social impact