CosmicAC Logo

Create a Managed Inference Job (Parakeet)

Create a speech-to-text Parakeet Managed Inference Job with the CLI.

Create a Parakeet Managed Inference Job from the CLI. Answer the prompts, and CosmicAC deploys the speech-to-text model behind an OpenAI-compatible transcription endpoint.

Prerequisites

You need the following before you start:

Steps

Create the job

Start the interactive job setup:

cosmicac jobs create

Select Managed Inference (Parakeet) as the job type, then set these fields:

  • Job name — a name to identify the job.
  • Tags — comma-separated labels for the job.
  • Location — the region where the job runs.
  • GPU type — the GPU to use. The CLI lists the GPUs available in your location.
  • GPU count — the number of GPUs.
  • Model — the Parakeet model to serve, nvidia/parakeet-tdt-0.6b-v3.
  • Chunk duration — the audio chunk length in seconds.
  • Chunk overlap — the overlap between chunks in seconds.
  • Max file size (MB) — the maximum audio upload size.
  • Endpoint name — a name for the endpoint, used in its URL path.
  • Require Authorization header — whether callers must send an API key. See Create an API key.

The Job configuration reference describes each field.

CosmicAC creates the job and prints its ID.

Confirm the deployment

Check that your endpoint is serving:

cosmicac models healthcheck

Your endpoint appears as Endpoint: <endpoint-name>.

Next steps

On this page