neural network model architecture exploration for reduced SearcH in function leaRnINK (SHRINK)

This repository presents a structured approach to determining optimal neural network configurations for approximating a function, although y=tanh(2x) is demonstrated, in can apply to any given dataset. The project focuses on circumventing reliance on heuristic choices like arbitrary model scaling and instead applies a quantifiable methodology informed by function complexity. By systematically evaluating model architectures, this repository contributes to establishing a correlation between function characteristics and the neural network size required to meet specific approximation criteria.

The project leverages evidence from grid searches across varying neural network depths and widths, comparing architectures based on loss performance and computational efficiency. Through rigorous experimentation, feature-based boundaries to guide the selection of network configurations that achieve desired approximation accuracy with minimized parameter counts have been defined (see parameter-calculation.py ).

num_params_required, func_complexity_measures = calculate_parameter_count_direct(
    x_data, y_data, 1e-7
) # 1e-7 is the desired error

Modeling y = tanh(2x)

Satisficing equation

Results

Sample Length of 1e2 (Total Data Train + Test = 1e3)

Time Taken: 1179.01 seconds approx. 0.014 days Top 10 architectures with the lowest losses:

N = 7, M = 40, Loss = 1.007e-04, Score = 70
N = 5, M = 45, Loss = 1.042e-04, Score = 60
N = 3, M = 47, Loss = 1.702e-04, Score = 36
N = 3, M = 49, Loss = 1.735e-04, Score = 39
N = 6, M = 45, Loss = 1.896e-04, Score = 72
N = 12, M = 47, Loss = 1.936e-04, Score = 144
N = 5, M = 44, Loss = 2.085e-04, Score = 55
N = 3, M = 28, Loss = 2.236e-04, Score = 21
N = 5, M = 41, Loss = 2.244e-04, Score = 55
N = 6, M = 26, Loss = 2.455e-04, Score = 42

Sample Length of 1e5(Total Data Train + Test = 1e6)

Time Taken: 105705.32 seconds approx. 1.223 days Top 10 architectures with the lowest losses:

N = 6, M = 44, Loss = 5.380e-07, Score = 66
N = 16, M = 37, Loss = 6.351e-07, Score = 160
N = 12, M = 49, Loss = 1.046e-06, Score = 156
N = 8, M = 45, Loss = 1.096e-06, Score = 96
N = 8, M = 49, Loss = 1.172e-06, Score = 104
N = 10, M = 43, Loss = 1.216e-06, Score = 110
N = 19, M = 47, Loss = 1.262e-06, Score = 228
N = 8, M = 24, Loss = 1.290e-06, Score = 48
N = 9, M = 16, Loss = 1.315e-06, Score = 36
N = 7, M = 37, Loss = 1.340e-06, Score = 70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

neural network model architecture exploration for reduced SearcH in function leaRnINK (SHRINK)

Modeling y = tanh(2x)

Satisficing equation

Results

Sample Length of 1e2 (Total Data Train + Test = 1e3)

Sample Length of 1e5(Total Data Train + Test = 1e6)

Files

README.md

Latest commit

History

README.md

File metadata and controls

neural network model architecture exploration for reduced SearcH in function leaRnINK (SHRINK)

Modeling y = tanh(2x)

Satisficing equation

Results

Sample Length of 1e2 (Total Data Train + Test = 1e3)

Sample Length of 1e5(Total Data Train + Test = 1e6)