Llama.cpp server destroys <|eot_id|> token even midway through prompt! #6793

araleza · 2024-04-20T17:58:28Z

In ./server, trying to correctly use Continuation mode with Llama 3 70B is not possible, as the correct prompt template cannot be entered. This is because the token <|eot_id|> becomes zero tokens, even when it occurs midway through the prompt:

(In the above image, I hit start and looked at the number of tokens cached minus the number of tokens predicted: 402 - 400 = 2. This value is the number of tokens I typed as my prompt. Llama's result shown is 2 where it should be 3. I deleted the generated tokens before taking this screenshot, to show what I originally typed)

This token is required multiple times by the prompt template, which looks like this:

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

[system prompt goes here]<|eot_id|><|start_header_id|>user<|end_header_id|>

[user prompt goes here]<|eot_id|><|start_header_id|>assistant<|end_header_id|>

[ai response will go here]

Not adhering to the prompt usually decreases the ability of the LLM.

The text was updated successfully, but these errors were encountered:

phymbert · 2024-04-20T18:01:56Z

Please wait for:

If your issue persists once you have converted again the HF model and run the latest server code with those PRs merged. Please ping. I will reopen

araleza added the bug-unconfirmed label Apr 20, 2024

phymbert closed this as completed Apr 20, 2024

araleza mentioned this issue Apr 20, 2024

Support Llama 3 conversion #6745

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama.cpp server destroys <|eot_id|> token even midway through prompt! #6793

Llama.cpp server destroys <|eot_id|> token even midway through prompt! #6793

araleza commented Apr 20, 2024 •

edited

Loading

phymbert commented Apr 20, 2024

Llama.cpp server destroys <|eot_id|> token even midway through prompt! #6793

Llama.cpp server destroys <|eot_id|> token even midway through prompt! #6793

Comments

araleza commented Apr 20, 2024 • edited Loading

phymbert commented Apr 20, 2024

araleza commented Apr 20, 2024 •

edited

Loading