Skip to content

2025 02 27 cloud provider benchmarks

Cloud provider benchmarks

Alternate Cloud Providers for GPU Access

Cloud Provider GPU Model Price per Month Price per Hour Setup Cost URL
Hetzner GEX 44 $205 N/A $88 Hetzner GEX 44
Digital Ocean H100 N/A $3.4 N/A Digital Ocean Pricing
Vast.ai Various N/A $0.5 N/A Vast.ai Pricing
Tensor Dock RTX 4090 N/A $0.5 N/A Tensor Dock Deploy
Hyperstack Cloud RTX A6000 N/A $0.5 N/A Hyperstack Cloud GPU Pricing
Run Pod RTX 4090 N/A $0.5 N/A Run Pod Pricing
  • Setup done on Huggingface Spaces

    • ASR - Automatic Speech Recognition

How to Use the Service

  1. With curl

You can test the service using curl commands. Below are examples for both service modes:

High Latency Service - CPU server

```sh curl_high_latency.sh curl -X 'POST' \ 'https://gaganyatri-asr-indic-server-cpu.hf.space/transcribe/?language=kannada' \ -H 'accept: application/json' \ -H 'Content-Type: multipart/form-data' \ -F 'file=@samples/kannada_sample_2.wav;type=audio/x-wav'

#### Low Latency Service - GPU service on Demand

```sh curl_low_latency.sh
curl -X 'POST' \
  'https://gaganyatri-asr-indic-server.hf.space/transcribe/?language=kannada' \
  -H 'accept: application/json' \
  -H 'Content-Type: multipart/form-data' \
  -F 'file=@samples/kannada_sample_2.wav;type=audio/x-wav'

  1. Via Swagger UI

  2. URL: High Latency ASR Service

  3. URL: Low Latency ASR Service

  4. Text to Speech

We have hosted a Text to Speech (TTS) service that can be used to verify the accuracy of Speech generation. The service is available in two modes:

Usage

How to Use the Service

You can test the service using curl commands. Below are examples for both service modes:

High Latency Service

```bash kannada_example.sh curl -X 'POST' \ 'https://gaganyatri-tts-indic-server.hf.space/v1/audio/speech' \ -H 'accept: application/json' \ -H 'Content-Type: application/json' \ -d '{"input": "ಉದ್ಯಾನದಲ್ಲಿ ಮಕ್ಕಳ ಆಟವಾಡುತ್ತಿದ್ದಾರೆ ಮತ್ತು ಪಕ್ಷಿಗಳು ಚಿಲಿಪಿಲಿ ಮಾಡುತ್ತಿವೆ.", "voice": "A female speaker delivers a slightly expressive and animated speech with a moderate speed and pitch. The recording is of very high quality, with the speakers voice sounding clear and very close up."}' -o audio_kannada_gpu_cloud.mp3

#### Low Latency Service

```bash kannada_example.sh
curl -X 'POST' \
  'https://gaganyatri-tts-indic-server-cpu.hf.space/v1/audio/speech' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{"input": "ಉದ್ಯಾನದಲ್ಲಿ ಮಕ್ಕಳ ಆಟವಾಡುತ್ತಿದ್ದಾರೆ ಮತ್ತು ಪಕ್ಷಿಗಳು ಚಿಲಿಪಿಲಿ ಮಾಡುತ್ತಿವೆ.", "voice": "A female speaker delivers a slightly expressive and animated speech with a moderate speed and pitch. The recording is of very high quality, with the speakers voice sounding clear and very close up."}'  -o audio_kannada_cpu_cloud.mp3

  1. Translation on Huggingface Spaces

  2. Test TTS + ASR for easy verification.

Test it on terminal/command line / postman / Insomnia

  1. Use the Text to Speech (TTS) to generate the speech by providing Kannada text. It creates a .wav file with name "audio_kannada_gpu_cloud.wav"

    curl -X 'POST' \
      'https://gaganyatri-tts-indic-server.hf.space/v1/audio/speech' \
      -H 'accept: application/json' \
      -H 'Content-Type: application/json' \
      -d '{"input": "ಉದ್ಯಾನದಲ್ಲಿ ಮಕ್ಕಳ ಆಟವಾಡುತ್ತಿದ್ದಾರೆ ಮತ್ತು ಪಕ್ಷಿಗಳು ಚಿಲಿಪಿಲಿ ಮಾಡುತ್ತಿವೆ.", "voice": "A female speaker delivers a slightly expressive and animated speech with a moderate speed and pitch. The recording is of very high quality, with the speakers voice sounding clear and very close up.",, "response_type": "wav"}'  -o audio_kannada_gpu_cloud.wav
    

  2. Now call the ASR - Automatic Speech Recognition by passing the Generated Speech

    curl -X 'POST' \
      'https://gaganyatri-asr-indic-server-cpu.hf.space/transcribe/?language=kannada' \
      -H 'accept: application/json' \
      -H 'Content-Type: multipart/form-data' \
      -F 'file=@audio_kannada_gpu_cloud.wav;type=audio/x-wav'
    

  3. Ola Krutrim CLoud

  4. ASR setup on Krutrim Cloud

Watch a quick demo of our project in action! Click the image below to view the video on YouTube.

Watch the video

We have hosted an Automatic Speech Recognition (ASR) service that can be used to verify the accuracy of audio transcriptions. The service is available in two modes:

We tested A100-NVLINK-Mini for the project. This setup is straightforward to initiate and deploy for basic inference tasks.

OlaKrutrim is an ideal provider for this task. They charge on an hourly basis with a calculated 15-minute interval, ensuring flexibility and cost-effectiveness. There is no long-term commitment required, making it easy to start and stop as needed.