Create an Endpoint
After your first login, you will be directed to the Endpoint creation page. As an example, this guide will go through the steps to deploy distilbert-base-uncased-finetuned-sst-2-english for text classification.
1. Enter the Model Database Repository ID and your desired endpoint name:
data:image/s3,"s3://crabby-images/ad139/ad139678736bec62e6110044a9cd9a2f4c5c7776" alt="select repository"
2. Select your Cloud Provider and region. Initially, only AWS will be available as a Cloud Provider with the us-east-1
and eu-west-1
regions. We will add Azure soon, and if you need to test Endpoints with other Cloud Providers or regions, please let us know.
data:image/s3,"s3://crabby-images/a0e93/a0e9327c50269c1b1cd6efa20b8ec5fb00824020" alt="select region"
3. Define the [Security Level](security) for the Endpoint:
data:image/s3,"s3://crabby-images/eb159/eb159a6935f4eb8d7828ea84fddec04572d2c765" alt="define security"
4. Create your Endpoint by clicking **Create Endpoint**. By default, your Endpoint is created with a medium CPU (2 x 4GB vCPUs with Intel Xeon Ice Lake) The cost estimate assumes the Endpoint will be up for an entire month, and does not take autoscaling into account.
data:image/s3,"s3://crabby-images/24434/244340747d0ad8aeeea78735bbd16d48ff088e56" alt="create endpoint"
5. Wait for the Endpoint to build, initialize and run which can take between 1 to 5 minutes.
data:image/s3,"s3://crabby-images/8d095/8d0952c2e8a84625e874f168b00eb9d1ca2a7121" alt="overview"
6. Test your Endpoint in the overview with the Inference widget π π!
data:image/s3,"s3://crabby-images/222fb/222fbf18be5e1eae13f81565bed20d6c745643d6" alt="run inference"