Inference Pricing

LLM, Image, Audio, Video, and Signal models are available through Inference API. For these models, Pay-As-You-Go or use a Flat rate of $1000 per month on your hosting. Prices are per 1,000 tokens including input and output tokens.

Model

Model Size	Large Language Model, Chat	Image/ Audio/ Video/ Signal	Price per Hour Hosting
40.1B - 70B	$0.0010	$0.01	$6.17

Get in touch so we can start working together.