Skip to main content

AWS Sagemaker

LiteLLM supports Llama2 on Sagemaker

API KEYS

!pip install boto3 

os.environ["AWS_ACCESS_KEY_ID"] = ""
os.environ["AWS_SECRET_ACCESS_KEY"] = ""
os.environ["AWS_REGION_NAME"] = ""

AWS Sagemaker Models

Here's an example of using a sagemaker model with LiteLLM

Model NameFunction CallRequired OS Variables
Llama2 7Bcompletion(model='sagemaker/jumpstart-dft-meta-textgeneration-llama-2-7b, messages=messages)os.environ['AWS_ACCESS_KEY_ID'], os.environ['AWS_SECRET_ACCESS_KEY'], os.environ['AWS_REGION_NAME']
Custom LLM Endpointcompletion(model='sagemaker/your-endpoint, messages=messages)os.environ['AWS_ACCESS_KEY_ID'], os.environ['AWS_SECRET_ACCESS_KEY'], os.environ['AWS_REGION_NAME']