AI-SRE
SRE
監視・ログ
マルチエージェントシステム
ログ分析
LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience
https://arxiv.org/pdf/2501.16744
AI-Assisted Incident Management in SRE: The Role of LLMs and Anomaly Detection
https://al-kindipublisher.com/index.php/jcsts/article/view/10054
“LLM for SRE“の世界探索
https://blog.yuuk.io/entry/2024/the-world-of-llm4sre
A Survey of AIOps for Failure Management in the Era of Large Language Models
https://arxiv.org/abs/2406.11213?utm_source=chatgpt.com
Awesome LLM AIOps
https://github.com/Jun-jie-Huang/awesome-LLM-AIOps
A Comprehensive Survey on Root Cause Analysis in (Micro) Services: Methodologies, Challenges, and Trends
https://arxiv.org/abs/2408.00803