Sentence Transformers Model

Sentence Transformers Model

FastAPI server to serve sentence-transformer embeddings

Deploy Sentence Transformers Model

Sentence Transformers API

morgangallant/sentence-transformers-railway

Just deployed

Given any sentence transformer model, this template will deploy a quick and easy FastAPI server to serve embeddings for that model.

Setup requires you to choose which model to run, and choose the number of workers. Be careful using models that are too large, since a) they'll be expensive since the weights will be loaded into memory N times (i.e. depending on number of workers you chose during setup), and b) they'll be running on CPU so inference isn't overly quick compared to GPUs.

Usage is simple, just post {"texts":["hello","world"]} to /embed on the exposed server, and you'll get back the embeddings in JSON.


Template Content

Deploy Now

Details

Created on Sep 24, 2023

22 total projects

11 active projects

Python, Dockerfile

AI/ML



More templates in this category

View Template
Chat Chat

Chat Chat

Chat Chat, your own unified chat and search to AI platform.


Harry Yep

View Template
openui

openui

Deploy OpenUI: AI-powered UI generation with GitHub OAuth and OpenAI API.


zexd

View Template
firecrawl

firecrawl

firecrawl api server + worker without auth, works with dify


Neuron Capital