It is mentioned in the post: > Traditional LLMs output everything they think imm... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		fittom on Aug 27, 2024 \| parent \| context \| favorite \| on: Cerebras Inference: AI at Instant Speed It is mentioned in the post: > Traditional LLMs output everything they think immediately, without stopping to consider the best possible answer. New techniques like scaffolding, on the other hand, function like a thoughtful agent who explores different possible solutions before deciding. This “thinking before speaking” approach provides over 10x performance on demanding tasks like code generation, fundamentally boosting the intelligence of AI models without additional training.

Workaccount2 on Aug 27, 2024 [–]

Is there a tool that provides functionality like this that you can layer on top of cerebras's API, given you are not worried about using 10x-50x more tokens per query.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact