https://gwern.net/gpt-2-preference-learning
Experiments with OpenAI’s ‘preference learning’ approach, which trains a NN to predict global quality of datapoints, and then uses reinforcement learning...
preference learningmusic generationgptgwernnet
https://arxiv.org/abs/2407.12164
Abstract page for arXiv paper 2407.12164: Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning
text to imagesubjectdrivengenerationvia
https://www.surveymonkey.com/r/learningpreference
Web survey powered by SurveyMonkey.com. Create your own online survey now with SurveyMonkey's expert certified FREE templates.
learningpreferencestyleindicatorsurvey