Ask HN: How do LLMs work really?

thrw0451 pts0 comments

Sorry I know this has been discussed to death sort of, but I feel like it s not always explicitly stated in some discussions and people have different views?Is it really so simple that all information in an LLM comes from the probability of each token based on the prompt? So for any prompt, there is a probability distribution to continuing (after) that prompt to generate text?All structure of information comes from probabilities of tokens (so all structure and information processing is a side effect of token probabilities)? Or is there other stuff going on? I know reasoning models have extra stuff but let s put that aside for now.

information prompt really know comes from

Ask HN: How do LLMs work really?

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits