Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Read Appendix A.6 of arXiv version of this paper for more details on how to use this code.

To reproduce the Fitting and Extrapolation of BNSL on 4 Digit Addition from Figure 5 Left, run

python fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py

To reproduce the Fitting and Extrapolation of BNSL on a noiseless simulation of the scaling behavior of 4 Digit Addition from Figure 5 Right, run

python fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py

To reproduce the Decomposition of BNSL into Power Law Segments from Figure 1, run

python make_figure_1__decomposition_of_bnsl_into_power_law_segments.py

Note:

🚨🚨🚨

When you fit a BNSL to your own scaling data, you may need to adjust the grid search range and resolution to get a good fit.

🚨🚨🚨

Here is some bibtex to use for citation:

@inproceedings{
caballero2023broken,
title={Broken Neural Scaling Laws},
author={Ethan Caballero and Kshitij Gupta and Irina Rish and David Krueger},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
url={https://arxiv.org/abs/2210.14891}
}

Name	Name	Last commit message	Last commit date
Latest commit ethancaballero Update fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axi… Aug 22, 2023 326155c · Aug 22, 2023 History 64 Commits
README.md	README.md	Update README.md	Aug 3, 2023
figure_1.png	figure_1.png	Add files via upload	Oct 30, 2022
fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py	fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis.py	Update fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axi…	Aug 22, 2023
fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py	fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.py	Update fit_bnsl_and_extrapolate__4_digit_addition__dataset_size_x-axi…	Aug 22, 2023
make_figure_1__decomposition_of_bnsl_into_power_law_segments.py	make_figure_1__decomposition_of_bnsl_into_power_law_segments.py	Update make_figure_1__decomposition_of_bnsl_into_power_law_segments.py	Jan 11, 2023
plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis.png	plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis.png	Add files via upload	Oct 27, 2022
plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.png	plot__bnsl__fit_and_extrapolate__4_digit_addition__dataset_size_x-axis__noiseless_simulation.png	Add files via upload	Nov 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Note:

Here is some bibtex to use for citation:

About

Releases

Packages

Languages

ArianKhorasani/broken_neural_scaling_laws

Folders and files

Latest commit

History

Repository files navigation

Code Release for "Broken Neural Scaling Laws" (BNSL) paper (arxiv.org/abs/2210.14891)

Note:

Here is some bibtex to use for citation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages