Mosaic LLMs (Part 1): Billion-Parameter GPT Training Made Easy
https://www.databricks.com/blog/billion-parameter-gpt-training-made-easy
https://www.databricks.com/sites/default/files/inline-images/billion-parameter-gpt-training-img-3.png?v=1703156859
Table 2