Q*Bert Reynolds@sh.itjust.workstoTechnology@lemmy.ml•Unpacking the hype around OpenAI’s rumored new Q* model
13·
11 months agoIt’s probably based on Q learning, which has been around for 30+ years, and I’m guessing the star is a nod to A* because it’s an optimization of some kind.
Says 1-bit then goes on to describe inputs as -1, 0, or 1. That’s 2-bit. Am I missing something here?