Building Trust in AI: Testing and Evaluating Large Language Models (LLMs)

Name: Building%20Trust%20in%20AI:%20Testing%20and%20Evaluating%20Large%20Language%20Models%20(LLMs)%20
Uploaded: 2023-09-28T06:05:15.048Z

Posted Sep 28, 2023 | Views 6K

# Enterprise

# GenAI

# LLMs

# AI Safety

# Webinar

Daniel Berrios

Chief of Staff @ Scale AI

Daniel Berrios is the Chief of Staff at Scale, where he focuses on strategic projects on behalf of the CEO, including the launch of Scale’s large language model test & evaluation platform. In his prior life, he served as Co-Founder and Chief Operating Officer of Helia, a machine learning company focused on physical security applications, which was acquired by Scale in 2020. Previously, he advised on IPOs, M&A, and corporate strategy at Goldman Sachs. He was born and raised in California, where he developed his deep passion for technology, and holds a degree in Economics from Stanford University.

+ Read More

Dylan Slack

Machine Learning Research Engineer @ Scale AI

Dylan Slack is a Machine Learning Research Engineer at Scale AI, and he previously earned his Ph.D. at UC Irvine advised by Sameer Singh and Hima Lakkaraju and was associated with UCI NLP, CREATE, and the HPI Research Center. His research focuses on developing techniques that help researchers and practitioners build more robust, reliable, and trustworthy machine learning systems. His work has been published at venues such as NeurIPS, Nature, EMNLP, and ACL, and has been cited 1000+ times. In the past, he has worked at Google AI and AWS.

+ Read More

SUMMARY

Understanding the capabilities, risks, and vulnerabilities of large language models is critical to ensuring the safety of these models. Join Scale as we discuss our vision for what an effective and comprehensive test and evaluation (“T&E”) regime for LLMs should look like moving forward, how that leverages human experts, as well as how we aim to help service this need with our new Scale LLM Test & Evaluation offering.

+ Read More

Watch More

OpenAI's Greg Brockman: The Future of Large Language (LLMs) and Generative Models

Posted Oct 20, 2022 | Views 6K

# GenAI

# LLMs

# Fireside Chat

Cohere: Unlocking the Potential of Large Language Models (LLMs)

Posted Oct 21, 2022 | Views 2.2K

# GenAI

# LLMs

# Fireside Chat

What's Next for AI Systems & Language Models With Ilya Sutskever of OpenAI

Posted Oct 06, 2021 | Views 23.3K

# Enterprise

# GenAI

# LLMs

# AI Strategy

# Fireside Chat

Building Trust in AI: Testing and Evaluating Large Language Models (LLMs)

Speakers

SUMMARY

Watch More