top of page

Inference Pricing


LLM, Image, Audio, Video, and Signal models are available through Inference API. For these models, Pay-As-You-Go or use a Flat rate of $1000 per month on your hosting. Prices are per 1,000 tokens including input and output tokens.

Model

Price per 1 token

Model Size
Large Language Model, Chat
Image/ Audio/ Video/ Signal
Price per Hour Hosting
40.1B - 70B
$0.0010
$0.01
$6.17

Fine-tuned models

After you fine-tune a model with the Fine-tuning API you can host it for inference. When hosting your own model you pay hourly for the GPU instances. You can start or stop your instance any time through the web-based Playground or using the start/stop instance APIs.

Let’s Get Started

Get in touch so we can start working together.

  • Youtube
  • X
  • Facebook
  • LinkedIn

Thanks for submitting!

Want to LEARN more?

Schedule a consultation to discuss your specific organization's needs and find out how Gennet.ai can deliver value to your organization.

USA Offices:

Greater Chicago Area: 581 Sullivan Rd Aurora, IL 60506, USA

 

DMV: 5680 King Centre Drive, Suite 600, Alexandria, VA 22315, USA

​

Canada Office: 

18 King Street East, Suite 1400, 
Toronto, ON, M5C 1C4 Canada

​

Contact Us:

Phone: +1-213-786-4783

Email: care@gennet.ai

HIPPA Compliance Certificate

image (2).png
HIPAA, Epic, AWS and Microsoft Azure are the plathforms Gennet.AI uses for compliances and cloud computing
cdn.png

Follow Us On:

  • LinkedIn
  • X
  • Facebook
  • Youtube
  • Whatsapp

Copyright © 2024
Gennet.AI, Unique Computing, LLC

 

​

All rights reserved. Terms and conditions, features, support, pricing, and service options are subject to change without notice.

bottom of page