tpu

Here are 167 public repositories matching this topic...

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda inference pytorch transformer llama gpt rocm model-serving tpu mlops llm inferentia llmops llm-serving trainium

Updated Jun 13, 2024
Python

skypilot-org / skypilot

Star

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Updated Jun 13, 2024
Python

google / jetstream-pytorch

Star

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

inference pytorch batching attention llama gemma model-serving tpu llm llm-inference llama2

Updated Jun 12, 2024
Python

google / JetStream

Star

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gpu inference pytorch transformer llama gpt gemma model-serving tpu jax mlops large-language-models llm llmops llm-inference llama2

Updated Jun 12, 2024
Python

ayaka14732 / tpu-starter

Sponsor

Star

Everything you want to know about Google Cloud TPU

machine-learning deep-learning gcp google-cloud-platform tpu jax cloud-tpu

Updated Jun 12, 2024
Python

young-geng / tpu_pod_commander

Star

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

google-cloud tpu

Updated Jun 12, 2024
Python

Kohulan / DECIMER-Image_Transformer

Star

DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer

python deep-learning tensorflow transformers tpu chemical-image-recognition image-data-mining decimer

Updated Jun 11, 2024
Python

hollance / neural-engine

Star

Everything we actually know about the Apple Neural Engine (ANE)

ios neural-network iphone ane coreml tpu neural-engine

Updated Jun 11, 2024

look4pritam / ArtificialIntelligence

Star

Artificial Intelligence

tensorflow cnn gan dcgan autoencoder ann cyclegan tpu attgan vggface2

Updated Jun 11, 2024
Jupyter Notebook

Pancronos / FPV-3D-print

Star

Collection of STL for drones

drone stl print inav 3d betaflight 3dprint tpu

Updated Jun 9, 2024
G-code

google / xpk

Star

xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerators such as TPUs and GPUs on GKE.

gke gcloud tpu