Top Keywords for Super Data Science: ML & AI Podcast with Jon Krohn

706: Large Language Model Leaderboards and Benchmarks

Podcast: Super Data Science: ML & AI Podcast with Jon Krohn
Published On: Fri Aug 18 2023
Description: In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.Additional materials: www.superdatascience.com/706Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

The note was deleted

The note was saved

Your message was sent

My Sentiment & Notes 706: Large Language Model Leaderboards and Benchmarks

706: Large Language Model Leaderboards and Benchmarks