Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
at2005's comments
login
at2005
6 hours ago
|
parent
|
context
|
next
[–]
| on:
Tree Search Distillation for Language Models Using...
I didn't compare with the harness (focused on distillation) but the original ToT paper has a section on it:
https://arxiv.org/abs/2305.10601
reply
at2005
1 day ago
|
parent
|
context
|
prev
|
next
[–]
| on:
Tree Search Distillation for Language Models Using...
Ah, I meant that MCTS uses more inference-time compute (over GRPO) to
produce
a training sample
reply
at2005
on Feb 23, 2021
|
parent
|
context
|
prev
[–]
| on:
A HL Programming Language for Quantum Computers
Btw the whole motivation for this were algorithms like Grover's, which need "oracles" to be specified. You can only imagine trying to code adders and greater-than circuits with QASM...
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
reply