Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Emit live metrics #6

Open
madhurprash opened this issue Jan 26, 2024 · 2 comments
Open

Emit live metrics #6

madhurprash opened this issue Jan 26, 2024 · 2 comments
Labels
help wanted Extra attention is needed

Comments

@madhurprash
Copy link
Collaborator

Emit live metrics so that they can be monitored through Grafana via live dashboard. More information to come on this issue but the goal here is to provide full flexibility to the user to be able to view metrics in ways that best suits the needs of their business and technological goals.

[TBD] --> Some sort of an analytics pipeline sending and emitting live results for different model configurations, their results and different metrics based on the needs of the user.

@aarora79 aarora79 added the help wanted Extra attention is needed label Feb 15, 2024
@aarora79
Copy link
Contributor

@madhurprash please get details from Shamik and add here in this ticket.

@shamikatamazon
Copy link
Collaborator

The intent of this functionality is to use Prometheus to store metrics and use Graphana as a tool to visualize graphs performance of an LLM workload. To achieve this, the client and possibly the serving engine should emit metrics which are scraped by a Prometheus agent and persisted in Prometheus. Graphana uses the data from Prometheus to display metrics in a dashboard that is provided by this project. The steps to deploy a model and the pipeline on an EC2 instance is outlined at https://github.com/shamikatamazon/genai/tree/main/benchmarking

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants