Scale Events
+00:00 GMT
Sign in or Join the community to continue

Introduction to Text and Code Embeddings in the OpenAI API

Posted Oct 18, 2022 | Views 721
# TransformX 2022
# Breakout Session
# GPT-3
# Large Language Models (LLMs)
Share
speaker
avatar
Arvind Neelakantan
Research Lead & Manager @ OpenAI

Arvind Neelakantan is a Research Lead and Manager at OpenAI working on deep learning research for real-world applications. He got his PhD from UMass Amherst where he was also a Google PhD Fellow. His work has received best paper awards at NeurIPS and at Automated Knowledge Base Construction workshop.

+ Read More
SUMMARY

Text embeddings are useful features in many applications including semantic search, predicting code completion, natural language, topic modeling, classification, and computing text similarity. Arvind Neelakantan, Research Lead and Manager at OpenAI, introduces the concept of embeddings, a new terminus in the OpenAI API.

When OpenAI originally introduced the API two years ago, it was based on the GPT-3 model, which was useful for many tasks. But, as Neelakantan explains, GPT-3 is not explicitly optimized to produce a single vector or embedding of the input. This ability, to have a condensed representation of the input, would be helpful for programmers and others to use as features for downstream applications, the OpenAI team determined. They set about building an unsupervised model that is good at getting this kind of single embedding, and created a contrastive pre-training model, which Neelakantan will describe. He covers use cases for embeddings, and how the API is used in the real world, including at JetBrains Research for astronomical research and at FineTune Learning, which builds education systems. FineTune is using text embeddings to more accurately find textbook content based on learning objectives.

+ Read More

Watch More

Applied Natural Language Understanding: Topic modeling for text insights - Going from LDA to using advances in language models
Posted Nov 24, 2022 | Views 1.4K
Panel: Race to Better Customer Experience: Value of Data and ML in Fintech
Posted Oct 06, 2021 | Views 2.1K
# TransformX 2021
Accelerate Efforts to Become Data-Driven in an Age of Disruption, Big Data, and AI
Posted Apr 22, 2022 | Views 2.1K
# Tech Talk