Preparation
create #baseline for perform_experiments.py (=coder_prompt)
What is the composition of the result of A?
The baseline results are loaded from a JSON file named final_info.json located in the #run_0 directory.
2D Diffusion
The baseline input data for the 2D Diffusion experiment is not based on text datasets like enwik8, Shakespeare, or text8 (for #NanoGPT ), which are commonly used for language modeling
Instead, the 2D Diffusion baseline uses synthetic 2D datasets that are designed to evaluate the performance of a diffusion model on simple geometric shapes
circle
This dataset contains data points arranged in a circular pattern
It is used to evaluate how well the model can learn and reproduce circular shapes through the diffusion process
dino
This dataset contains points arranged in a pattern that resembles a dinosaur
It is often used as a visually complex shape to test the model's ability to handle non-linear and intricate patterns
line
This dataset contains points arranged in a linear pattern, such as a straight line
It is used to assess the model's performance on simple linear data distributions
moons
This dataset contains points arranged in two interleaving half-moon shapes
It is a popular synthetic dataset used to evaluate models on binary classification tasks with non-linear decision boundaries