Latest Post

Benchmarking a Moving Target, or let’s run a hypo through 7 AIs and see what happens

September 5, 2025

Debbie Ginsberg, Guest Blogger

Benchmarking should be simple, right? Come up with a set of criteria, run some tests, and compare the answers. But how do you benchmark a moving target like generative AI?

Over the past months, I’ve tested a sample legal question in various commercial LLMs (like ChatGPT and Google Gemini) and RAGs…

Subscribe:

Blogs

AI Law Librarians

Firm/Org

Capital University Law School