OpenAI’s InstructGPT: Aligning Language Models With Human Intent

Name: OpenAI%E2%80%99s%20InstructGPT:%20Aligning%20Language%20Models%20With%20Human%20Intent
Uploaded: 2022-09-09T15:06:09.019Z

Posted Sep 09, 2022 | Views 45K

# Large Language Models (LLMs)

# Natural Language Processing (NLP)

Long Ouyang

Research Scientist @ OpenAI

Long Ouyang is a research scientist at OpenAI, where he works on human-in-the-loop machine learning. He has helped build variants of GPT, such as InstructGPT and WebGPT. Previously, he obtained his PhD in cognitive psychology from Stanford.

He's the lead author of, Training language models to follow instructions with human feedback: https://arxiv.org/abs/2203.02155

+ Read More

Aerin Kim

Engineering Manager @ Scale AI

Aerin Kim is the Engineering Manager at Scale AI, turning raw data into high-quality training data using machine learning. She is fascinated by the science aspect of training data, the one and only input of deep learning that determines the performance of the model. Prior to Scale, Aerin was a Senior Research Software Engineer at Microsoft where she worked on question answering, semantic parsing and training data generation for AI applications. Before joining Microsoft, Aerin received her Masters degree in Operations Research from the Fu Foundation School of Engineering and Applied Science at Columbia University.

+ Read More

SUMMARY

Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not helpful to the user. In other words, these models are not aligned with their users. In this presentation, Long Ouyang, Research Scientist at OpenAI, and Aerin Kim, Engineering Manager at Scale AI will explore an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning with human feedback.

Join Long, OpenAI Research Scientist, and Aerin, Scale AI Engineering Manager, for a technical presentation followed by a discussion around Long's work with InstructGPT, and audience Q&A.

+ Read More

Watch More

What's Next for AI Systems & Language Models With Ilya Sutskever of OpenAI

Posted Oct 06, 2021 | Views 23.3K

# TransformX 2021

# Fireside Chat

Building Trust in AI: Testing and Evaluating Large Language Models (LLMs)

Posted Sep 28, 2023 | Views 6K

OpenAI's Greg Brockman: The Future of Large Language (LLMs) and Generative Models

Posted Oct 20, 2022 | Views 6K

# TransformX 2022

# Fireside Chat

# Natural Language Processing (NLP)

# Foundation Models

# Large Language Models (LLMs)

# Generative Models

OpenAI’s InstructGPT: Aligning Language Models With Human Intent

Speakers

SUMMARY

Watch More