TinyLoRA – Learning to Reason in 13 Parameters
66 points by sorenjan 5 days ago | 6 comments
measurablefunc 2 hours ago
With four parameters I can fit an elephant, and with five I can make him wiggle his trunk so there is still room for improvement.
replyesafak 60 minutes ago
Except learning to reason is a far cry from curve fitting. Our brains have more than five parameters.
replyvoxelghost 9 minutes ago
After a quick content browse, my understanding is this is more like with a very compressed diff vector, applied to a multi billion parameter model, the models could be 'retrained' to reason (score) better on a specific topic , e.g. math was used in the paper
reply
[0]: cartesien.io or Salesforce's WebscaleRL