Default Branch

858f6b73f6 · Add an option to build without CUDA VMM (#7067) · Updated 2024-05-06 20:12:14 +02:00

Branches

94e667a9d8 · gguf-py : add tqdm as a dependency · Updated 2024-05-06 15:08:09 +02:00

2
25

c32d39cefb · Merge branch 'master' into compilade/convert-hf-refactor · Updated 2024-05-06 11:33:38 +02:00

2
19

3f4fed062b · Update run.sh · Updated 2024-05-06 11:08:45 +02:00

2
2

09e3a9ea20 · no final newlines · Updated 2024-05-06 05:50:37 +02:00

3
7

d4abd657ad · Fix flake8 · Updated 2024-05-05 08:23:24 +02:00

22
8

e3cd5527cc · flake.lock: Update · Updated 2024-05-05 02:17:54 +02:00

8
1

c240ae234c · ci : fix arg order · Updated 2024-04-30 10:43:36 +02:00

32
145

b6fafd1747 · llama : remove useless return value for some llama_cache_* functions · Updated 2024-04-29 18:59:43 +02:00

35
8

5ddad95e5c · ci : tmp disable gguf-split · Updated 2024-04-29 17:29:38 +02:00

33
1

80cb3127df · tests : disable test-tokenizer-1-bpe due to slowness · Updated 2024-04-29 14:24:39 +02:00

45
61

8c259f6f3e · ggml : fix MIN / MAX macros · Updated 2024-04-25 13:28:41 +02:00

70
1

f2588b0b70 · convert : fix set_vocab_sentencepiece · Updated 2024-04-24 09:19:38 +02:00

79
1

5dcccb3a7d · convert : fix tokenizer conversion · Updated 2024-04-23 21:11:09 +02:00

81
2

124e4dced2 · Update · Updated 2024-04-22 11:42:32 +02:00

138
2

3750706962 · llama : add llama_token_is_eog() · Updated 2024-04-20 15:52:03 +02:00

103
4

f02ea667c1 · ggml : temporary disable llamafile sgemm until fixed · Updated 2024-04-16 21:45:56 +02:00

112
1

eedd42e376 · KV Cache defrag hash overflow - TMP Fix by @slaren · Updated 2024-04-16 10:24:34 +02:00

115
1

80d6c8152c · ggml : hash table improvements · Updated 2024-04-15 17:58:54 +02:00

119
1

8b495540fa · imatrix : remove invalid assert · Updated 2024-04-12 10:45:12 +02:00

140
1

072e0a4d3b · scipts : add LICENSE and gen-authors.sh to sync · Updated 2024-04-09 08:19:33 +02:00

216
3