Does In-IDE Calibration of Large Language Models Work at Scale?
Empirical investigation of calibration techniques for in-IDE LLM suggestions at scale.
Empirical investigation of calibration techniques for in-IDE LLM suggestions at scale.
Preprint on improving developer understanding of LLM-generated unit tests.