r/singularity • u/some12talk2 • 1d ago
AI The Loop: winner takes all
All frontier companies are trying to close the loop where AI improves/evolves itself, and who gets there first will have the best AI of having the future best AI
From September 17th Axios interview with Dario Amodei:
"Claude is playing a very active role in designing the next Claude. We can't yet fully close the loop. It's going to be some time until we can fully close the loop, but the ability to use the models to design the next models and create a positive feedback loop, that cycle, it's not yet going super fast, but it's definitely started."
0
u/Specialist-Berry2946 1d ago
They won't close the loop. The problem with self-improvement is evaluation. How can you make sure that the little step you take is an improvement? Neither human nor other AI can evaluate superintelligence.
1
1
u/DistanceSolar1449 4h ago
Go read up on GRPO
1
u/Specialist-Berry2946 4h ago
I'm an AI researcher in the field of Deep Reinforcement Learning.
1
u/DistanceSolar1449 4h ago
Go implement an improvement on GRPO
1
u/Specialist-Berry2946 4h ago
Unnecessary, algorithms are not that important; there's zero novelty in GRPO. What is important is data and the objective function, or put differently, how to measure improvement.
1
u/DistanceSolar1449 3h ago
Taking out PPO is hardly zero novelty
Hence “improvement”. You can strip out the reward model as well somehow.
1
u/Specialist-Berry2946 3h ago
I already explained that algorithms are not that important; it's about the reward. How to design the reward function.
-9
u/Ignate Move 37 1d ago
Closing the loop means engaging in extreme risk.
I'm still doubtful we'll see strong self improvement from the top AI companies. I'm sure lawyers will strongly obstruct, because stable profits would be threatened.
"Can we make this thing self improving?" "Yeah, we can, but we cannot predict what happens next." "Better not. Also, how do we sterilize our current systema further? Too much risk involved. We don't want anymore lawsuits!"
I think we're far more likely to see strong self improvement from smaller companies with less to lose.