Robuta

Sponsor of the Day: Jerkmate
https://www.clug.de/2016/08/ [C]hemnitzer [L]inux [U]ser [G]roup » 2016 » August u ser2016 augustclroup https://openreview.net/forum?id=roNSXZpUDN {$\tau$}-bench: A Benchmark for \underline{T}ool-\underline{A}gent-\underline{U}ser Interaction in... Existing benchmarks for language agents do not set them up to interact with human users or follow domain-specific rules, both of which are vital to safe and... u sertaubenchunderlineool https://www.clug.de/category/stammtisch/ [C]hemnitzer [L]inux [U]ser [G]roup » Stammtisch u serclgroup