Ask HN: How do LLMs work really?

thrw0451 pts0 comments

Sorry I know this has been discussed to death sort of, but I feel like it s not always explicitly stated in some discussions and people have different views?Is it really so simple that all information in an LLM comes from the probability of each token based on the prompt? So for any prompt, there is a probability distribution to continuing (after) that prompt to generate text?All structure of information comes from probabilities of tokens (so all structure and information processing is a side effect of token probabilities)? Or is there other stuff going on? I know reasoning models have extra stuff but let s put that aside for now.

information prompt really know comes from

Related Articles