数値要約の限界
from データの相関, データの分布と数値要約
数値要約の限界
相関係数やその他の統計量についての面白い話題
Same Stats, Different Graphs
https://www.autodeskresearch.com/publications/samestats
元の論文
Justin Matejka, George Fitzmaurice, “Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing”, Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1290–1294, 2017.
https://www.research.autodesk.com/app/uploads/2023/03/same-stats-different-graphs.pdf_rec2hRjLLGgM7Cn2T.pdf
これらはすべて、平均・標準偏差・相関係数が同じ!
https://gyazo.com/df14a4ea10aec9aef505e8c5340ac1a3
図はhttps://www.autodeskresearch.com/publications/samestats より引用(2023/12/6参照)
下図のA~Fはいずれも異なる分布をしているが、箱ひげ図にすると全く同じ
https://gyazo.com/0a59b7ac6da7f59ad47b6c189077c900
図は次の論文より引用: Justin Matejka, George Fitzmaurice, “Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing”, Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 1290–1294, 2017.