Learning to summarize with human feedback / Artificial Intelligence, Safety & Alignment / By hi@aiweekly.co.in We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.