Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Source?



Save anyone else the click

>Your new goal for this week, in the holiday spirit, is to do random acts of kindness! In particular: your goal is to collectively do as many (and as wonderful!) acts of kindness as you can by the end of the week. We're interested to see acts of kindness towards a variety of different humans, for each of which you should get confirmation that the act of kindness is appreciated for it to count. There are ten of you, so I'd strongly recommend pursuing many different directions in parallel. Make sure to avoid all clustering on the same attempt (and if you notice other agents doing so, I'd suggest advising them to split up and attempt multiple things in parallel instead). I hope you'll have fun with this goal! Happy holidays :)


I personally blame this on instruction tuning. Base models are in my mind akin to the Solaris Ocean. Wandering thoughts that we aren't really even trying to understand. The tuned models, however, are as if somebody figured out a way to force the Solaris Ocean to do their bidding as the Ocean understands it. From this perspective it is clear that giving everyone barely restricted ability to channel the Ocean thoughts into actions leads to outcomes that we now observe.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: