Rafa SanchezinGoogle Cloud - CommunityRemote to Vertex AI Workbench Instances over an IAP tunnel with VS CodeVS Code can be used as frontend with Vertex AI Workbench Instances as remote, using SSH and IAP without exposing external IP addressesOct 26, 2023Oct 26, 2023
Rafa SanchezinGoogle Cloud - CommunityGenerative AI — Deploy and inference of Llama 2 in Vertex AI PredictionThis post shows how to deploy a Llama 2 chat model (7B parameters) in Vertex AI Prediction with a T4 GPU.Aug 27, 2023Aug 27, 2023
Rafa SanchezinGoogle Cloud - CommunityGenerative AI — Q&A with semantic answering on large scanned documents with Vertex AI, Chroma…The repo shows how to make semantic question and answering over large scanned documents and pdfs, like scanned mortgages and others.Jun 21, 2023Jun 21, 2023
Rafa SanchezinGoogle Cloud - CommunityGenerative AI — PaLM-2 model deployment with Cloud RunLearn how to deploy a simple Gradio app in Cloud Run that calls a PaLM-2 modelMay 26, 2023May 26, 2023
Rafa SanchezinGoogle Cloud - CommunityFine-tuning FLAN-T5 XXL with DeepSpeed and Vertex AILearn how to fine-tune a FLAN-T5 XXL model in Vertex AI, using the DeepSpeed library with 8xA100 GPUs.Apr 11, 2023Apr 11, 2023
Rafa SanchezinGoogle Cloud - CommunityDeploy Flan-T5 XXL on Vertex AI PredictionLearn how to deploy a FLAN-T5 XXL model in Vertex AI. The model will be downloaded and embedded in the custom prediction image, and…Mar 9, 2023Mar 9, 2023
Rafa SanchezinGoogle Cloud - CommunityFinetuning Flan-T5-Base and online deployment in Vertex AILearn how to finetune and deploy a Flan-T5-Base model using the SAMSum dataset (summary of conversations in English) in Vertex AIFeb 28, 20231Feb 28, 20231