Souradip Chakraborty @ Neurips 2025
Souradip Chakraborty @ Neurips 2025 @SOURADIPCHAKR18 ·
#DPO instilled a lot of interesting conversations and research discussions around #RLHF #rewardlearning Great to see it being recognized and Congratulations to @rm_rafailov @ericmitchellai @architsharma @chelseabfinn and team. P.S .: #Alignment of #LLMs #GenAI is is crucial
Chelsea Finn Chelsea Finn @chelseabfinn ·
DPO is a runner up for NeurIPS outstanding paper. 🙌 Big congrats especially to the students@rm_rafailov@archit_sharma97@ericmitchellai & the other awardees. If you haven't learned about DPO already, check out the oral & poster 👇 on Thurs afternoonqF
1
1.3K
Fit for Tweet
Fit for Tweet @fit4tweet ·
#rewardlearning dominates organic life. Most societies are structured that way. We watch dogs get a treat after accomplishing a tax. That’s us superior and smart. You get a bonus, leave a tip or like a post. That’s subliminal.
Ralph-Christian Ohr
Ralph-Christian Ohr @ralph_ohr ·
Please give this a 'Like' if you find it insightful... 😜 #socialmedia #rewardlearning #addiction #onlinebehavior #SkinnerBox
1