Published inGoogle Cloud - CommunityA review of Arxiv publications on Gemma family modelsA collection of Arxiv papers describing main innovations of Gemma family modelsDec 9, 20241Dec 9, 20241
Published inGoogle Cloud - CommunityRemote to Vertex AI Workbench Instances over an IAP tunnel with VS CodeVS Code can be used as frontend with Vertex AI Workbench Instances as remote, using SSH and IAP without exposing external IP addressesOct 26, 2023Oct 26, 2023
Published inGoogle Cloud - CommunityGenerative AI — Deploy and inference of Llama 2 in Vertex AI PredictionThis post shows how to deploy a Llama 2 chat model (7B parameters) in Vertex AI Prediction with a T4 GPU.Aug 27, 2023Aug 27, 2023
Published inGoogle Cloud - CommunityGenerative AI — Q&A with semantic answering on large scanned documents with Vertex AI, Chroma…The repo shows how to make semantic question and answering over large scanned documents and pdfs, like scanned mortgages and others.Jun 21, 2023Jun 21, 2023
Published inGoogle Cloud - CommunityGenerative AI — PaLM-2 model deployment with Cloud RunLearn how to deploy a simple Gradio app in Cloud Run that calls a PaLM-2 modelMay 26, 2023May 26, 2023
Published inGoogle Cloud - CommunityFine-tuning FLAN-T5 XXL with DeepSpeed and Vertex AILearn how to fine-tune a FLAN-T5 XXL model in Vertex AI, using the DeepSpeed library with 8xA100 GPUs.Apr 11, 2023Apr 11, 2023
Published inGoogle Cloud - CommunityDeploy Flan-T5 XXL on Vertex AI PredictionLearn how to deploy a FLAN-T5 XXL model in Vertex AI. The model will be downloaded and embedded in the custom prediction image, and…Mar 9, 2023Mar 9, 2023
Published inGoogle Cloud - CommunityFinetuning Flan-T5-Base and online deployment in Vertex AILearn how to finetune and deploy a Flan-T5-Base model using the SAMSum dataset (summary of conversations in English) in Vertex AIFeb 28, 20231Feb 28, 20231