Quick start with Google Cloud Vertex AI

Availability

Business Plan for the Wren AI Self-Hosted Version.

Supported regions

Ensure that your selected region supports the following Google model we're using by default:

Refer to the official Google model endpoint locations: Google model endpoint locations.

If you encounter any issues, feel free to contact us to discuss alternative model selections for your region.

Prerequisites

Google Gemini (Vertex AI) onboarding

Enter Vertex location (region), Vertex project (your GCP project ID), then upload your service account JSON.

Google Gemini (Vertex AI) region and project

Google Gemini (Vertex AI) test embedding model connection

Update model configurations. There are default configs of the pre-selected models on the UI.

You'll need to update the configurations for optimization. Click "Configure" for each listed model and set:
```
{
  "kwargs": {
    "n": 1,
    "timeout": 60,
    "temperature": 0
  },
  "context_window_size": 1000000
}
```
Review the pipeline assignments.

We provide default pipelines optimized for Gemini models. You could leave it as is.
Complete the setup.

Scroll to the bottom and click "Complete setup".