LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
Giles’ blog About Contact Archives Categories Blogroll December 2025 (1) November 2025 (3) October 2025 (9) September 2025 (3) August 2025 (5) July 2025 (1) June 2025 (2) May 2025 (3) April 2025 (2) March 2025 (7) February 2025 (10) January 2025 (6) December 2024 (7) September 2024 (1) August 2024 (2) July 2024 (2) May 2024 (2) April 2024 (2) February 2024 (2) April 2023 (1) March 2023 (2) September 2022 (1) February 2022 (1) November 2021 (1) March 2021 (1) February 2021 (2) August 2019 (1) November 2018 (1) May 2017 (1) December 2016 (1) April 2016 (1) August 2015 (1) December 2014 (1) August 2014 (1) March 2014 (1) …