Skip to content

Pull requests: triton-inference-server/server

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Added new flag for GPU peer access API control
#7261 opened May 23, 2024 by indrajit96 Loading…
Update expected error message
#7258 opened May 22, 2024 by kthui Loading…
Add response sender test for new behavior
#7257 opened May 22, 2024 by kthui Loading…
Add response sender test base
#7254 opened May 21, 2024 by kthui Loading…
Add testing for decoupled model use case
#7246 opened May 20, 2024 by krishung5 Loading…
Ci updates
#7241 opened May 17, 2024 by nvda-mesharma Loading…
Jetpack 5.1.2 Patch 2
#7217 opened May 14, 2024 by fpetrini15 Draft
Bump vllm to v0.4.2 module: backends Issues related to the backends
#7198 opened May 9, 2024 by kebe7jun Loading…
add test for shape validation
#7195 opened May 8, 2024 by jbkyang-nvi Loading…
Add testing for L0_iterative_tutorial
#7176 opened May 1, 2024 by Tabrizian Loading…
Change TensorRT-LLM
#7143 opened Apr 20, 2024 by mc-nv Draft
Update L0_trace for DLIS-6462
#7134 opened Apr 18, 2024 by indrajit96 Loading…
update openvino to 2024.0.0
#7089 opened Apr 9, 2024 by dtrawins Loading…
Have ability to split the build in CI
#7076 opened Apr 5, 2024 by mc-nv Loading…
added support to intel gpu in a custom build
#7047 opened Mar 27, 2024 by dtrawins Loading…
Update quickstart.md
#7033 opened Mar 26, 2024 by inf3rnus Loading…
debugging guide: fix verbose logging option
#6927 opened Feb 29, 2024 by catwell Loading…
Update pytorch version for tests
#6907 opened Feb 23, 2024 by jbkyang-nvi Loading…
ProTip! no:milestone will show everything without a milestone.