Show HN: Kitchen Rush, Overcooked inspired LLM tool calling benchmark (opens in new tab)
Kitchen Rush: a benchmark for accurate AND fast native tool calling - bassimeledath/kitchen-rush
Read the original articleKitchen Rush: a benchmark for accurate AND fast native tool calling - bassimeledath/kitchen-rush
Read the original article