テック系の気になる動画たち

ニューノーマル時代のテックの使い方やベンチマークなど勉強用です。

NEW ARTICLE

bihar vridha pension kyc status kaise check kare | elabharthi pension ekyc online | Elabharthi Ekyc

Deploy Gemma 2 LLM with Text Generation Inference (TGI) on Google Cloud GPU

2024.11.24
Google Cloud Platform

Deploy Gemma 2 LLM with Text Generation Inference (TGI) on Google Cloud GPU

My medium article with the code and detailed guide:
https://medium.com/@agapie/deploy-gemma-2-llm-with-text-generation-inference-tgi-on-google-cloud-gpu-86093af9e9e2

TGI_DOCKER_URI = “us-docker.pkg.dev/deeplearning-platform-release/gcr.io/huggingface-text-generation-inference-cu124.2-3.ubuntu2204.py311”

Gemma 2:
https://ai.google.dev/gemma?gad_source=1&gclid=CjwKCAiAl4a6BhBqEiwAqvrqukjyS2ruigrWFgu3RBY7CQTVwLIWo0lc-Xkjh3KOj7tR18uaF77peRoCH4AQAvD_BwE
https://huggingface.co/google/gemma-2-2b-it
https://arxiv.org/abs/2408.00118

Text Generation Inference (TGI):
https://huggingface.co/docs/text-generation-inference/en/index
https://huggingface.co/blog/martinigoyanes/llm-inference-at-scale-with-tgi

前の記事

DeFi Technologies’ Surge: Here’s What You Need to Know 2024.11.24
次の記事

Ration Card EKYC Status Online Check – Ration Card e KYC Online | Ration Card e KYC Status 2024.11.24

関連する記事

Google Cloud Platform 導入事例 – 株式会社メルペイ 2019.11.11

2019 年 2 月に、スマホ決済サービスの新たな選択肢として登場した『メルペイ』。若者世代を中心に絶大な支持を集めるフリマアプリ『メルカリ』と連携す[…]
[RAG]Google Cloud×Gemini×LINEハンズオン【BOT AWARDS2024】 2024.04.24

[RAG]Google Cloud×Gemini×LINEハンズオン【BOT AWARDS2024】 https://zenn.dev/ymd6553[…]
Getting Started with AI on Google Cloud Platform:Episode 1: A Tour of Google Cloud 2023.01.27

# Title # Getting Started with AI on Google Cloud Platform:Episode 1: A Tou[…]