Skip to content

Actions: ggerganov/llama.cpp

All workflows

Actions

Loading...

Showing runs from all workflows
76,398 workflow runs
76,398 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

ggml: implement quantized KV cache for FA
EditorConfig Checker #12793: Pull request #7372 opened by JohannesGaessler
May 18, 2024 20:34 4m 5s JohannesGaessler:fa-quantize-3
May 18, 2024 20:34 4m 5s
ggml: implement quantized KV cache for FA
Publish Docker image #12335: Pull request #7372 opened by JohannesGaessler
May 18, 2024 20:34 In progress JohannesGaessler:fa-quantize-3
May 18, 2024 20:34 In progress
ggml: implement quantized KV cache for FA
Benchmark #1508: Pull request #7372 opened by JohannesGaessler
May 18, 2024 20:34 Queued
May 18, 2024 20:34 Queued
ggml: implement quantized KV cache for FA
Pull Request Labeler #22: Pull request #7372 opened by JohannesGaessler
May 18, 2024 20:34 13s
May 18, 2024 20:34 13s
ggml: implement quantized KV cache for FA
Server #3215: Pull request #7372 opened by JohannesGaessler
May 18, 2024 20:34 In progress
May 18, 2024 20:34 In progress
grammars: early exit when no next_candidates to reject
Pull Request Labeler #21: Pull request #7370 opened by ochafik
May 18, 2024 17:46 12s
May 18, 2024 17:46 12s
grammars: early exit when no next_candidates to reject
Server #3214: Pull request #7370 opened by ochafik
May 18, 2024 17:46 9m 29s
May 18, 2024 17:46 9m 29s
grammars: early exit when no next_candidates to reject
Benchmark #1507: Pull request #7370 opened by ochafik
May 18, 2024 17:46 Queued
May 18, 2024 17:46 Queued