Picture Generated by Bing Image Creator

Reinforcement Learning with Human Feedback

A gentle introduction

Valentina Alto
6 min readDec 10, 2023

--

Large Language Models (LLMs) have demonstrated outstanding capabilities in their conversational interactions with human.

In fact, the way we typically consume LLMs is via AI Assistants, such as ChatGPT. The reason why ChatGPT and similar AI assistants were so disruptive (ChatGPT reached 1M users in just 5 days!) is that they are…

--

--

Valentina Alto

Data&AI Specialist at @Microsoft | MSc in Data Science | AI, Machine Learning and Running enthusiast