triton-inference-server / server Public

Notifications You must be signed in to change notification settings
Fork 1.4k
Star 7.5k

Code
Issues 432
Pull requests 56
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Issues: triton-inference-server/server

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

432 Open 3,091 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

triton-inference-server cannot be started

#7293 opened May 29, 2024 by tuninger

Backend support for .keras files?

#7289 opened May 28, 2024 by chriscarollo

Support histogram custom metric in Python backend enhancement

New feature or request

#7287 opened May 28, 2024 by ShuaiShao93

What is the correct way to run inference in parallel in Triton?

#7283 opened May 28, 2024 by sandesha-hegde

A Confusion about prefetch performance

A possible performance tune-up

question

Further information is requested

#7282 opened May 28, 2024 by SunnyGhj

Windows 10 docker build Error "Could not locate a complete Visual Studio instance" investigating

The developement team is investigating this issue

#7281 opened May 28, 2024 by jinkilee

Specific structure for ensemble model may causes deadlock

#7280 opened May 28, 2024 by ukus04

Automatically unload (oldest) models when memory is full enhancement

New feature or request

#7279 opened May 27, 2024 by elmuz

YOLOv8n-poses is giving me a negative output error

#7278 opened May 27, 2024 by olooeez

No 24.05-trtllm-python-py3 in NGC Repo question

Further information is requested

#7277 opened May 25, 2024 by avianion

No trtllm tag in ngc for 24.05

#7276 opened May 25, 2024 by TheCodeWrangler

[Bug] Model 'ensemble' receives inputs originated from different decoupled models

#7275 opened May 25, 2024 by michaelnny

Triton BLS model with dynamic batching does not execute expected batch size. investigating

The developement team is investigating this issue

#7271 opened May 24, 2024 by njaramish

the method hang

#7269 opened May 24, 2024 by fishfl

Tritonserver hangs on launch with python backend investigating

The developement team is investigating this issue

#7268 opened May 24, 2024 by JamesBowerXanda

Custom backend using recommended.cc not generating correct output investigating

The developement team is investigating this issue

#7266 opened May 24, 2024 by jgrsdave

Pods Receiving Traffic Too Early When Scaling with HPA Causes 'Socket Closed' Errors on Triton Inference Server investigating

The developement team is investigating this issue

#7264 opened May 23, 2024 by patriksabol

Add to the serve-side metrics on the input and output sizes

#7263 opened May 23, 2024 by yongbinfeng

CUDA Failing to initialize in docker container

#7262 opened May 23, 2024 by regexboi

launch_triton_server.py attempts to place two models on the same GPU instead of one model on two GPUs

#7255 opened May 21, 2024 by ethan-digi

max_batch_effect

#7250 opened May 21, 2024 by riyajatar37003

Triton Server for the model mixtral-8x7b

#7249 opened May 21, 2024 by harievg

Feature Questions

#7244 opened May 20, 2024 by cha-noong

Build error when building new image on top of the nvcr.io/nvidia/tritonserver:24.04-py3-sdk container image from NGC

#7243 opened May 18, 2024 by jackylu0124

trt accelerator

#7238 opened May 17, 2024 by riyajatar37003

Previous 1 2 3 4 5 … 17 18 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly