Freelance Opportunity Transcription Specialist Remote | Uber AI Solutions | Handshake

Transcription Specialist

Uber AI Solutions is seeking detail-oriented transcription specialists to support a large-scale generative AI training program. In this engagement, you will transcribe and annotate audio files (Single & Multitrack) with accuracy, capturing utterance, stutter, and linguistic nuance exactly as spoken.

Supported Languages & Dialects

We are looking for freelancers in the following languages: 

  • Arabic: (ar-001 | ar-MSA), (ar-SA), (ar-AE | ar-UAE)
  • Bengali: (bn-BD | bn-IN)
  • Catalan: (ca-ES)
  • Chinese: (zh-CN | zh-Hans), (zh-Hant), (zh-HK), (zh-TW)
  • Croatian: (hr-HR)
  • Czech: (cs-CZ)
  • Danish: (da-DK)
  • Dutch: (nl-NL)
  • English: (en-US), (en-GB)
  • Estonian: (et-EE)
  • Finnish: (fi-FI)
  • French: (fr-FR), (fr-CA)
  • German: (de-DE), (de-CH)
  • Greek: (el-GR)
  • Hebrew: (he-IL)
  • Hindi: (hi-IN)
  • Hungarian: (hu-HU)
  • Indonesian: (id-ID)
  • Italian: (it-IT)
  • Japanese: (ja-JP)
  • Kannada: (kn-IN)
  • Korean: (ko-KR)
  • Lithuanian: (lt-LT)
  • Maithili: (mai-IN)
  • Malay: (ms-MY)
  • Malayalam: (ml-IN)
  • Norwegian: (no-NO)
  • Polish: (pl-PL)
  • Portuguese: (pt-PT), (pt-BR)
  • Romanian: (ro-RO)
  • Russian: (ru-RU)
  • Sinhala: (si-LK)
  • Slovak: (sk-SK)
  • Spanish: (es-ES), (es-US), (es-419 | es-LATAM), (es-MX)
  • Swedish: (sv-SE)
  • Tagalog/Filipino: (tl-PH)
  • Tamil: (ta-IN)
  • Telugu: (te-IN)
  • Thai: (th-TH)
  • Turkish: (tr-TR)
  • Ukrainian: (uk-UA)
  • Urdu: (ur-PK)
  • Vietnamese: (vi-VN)

What you’ll work on

  • Transcription: Transcribe audio with 98% accuracy, capturing every disfluency, filler word (um, uh), false start, and stutter exactly as heard.
  • Precision Timestamping: Align text segments to the audio waveform with millisecond precision (max gap <500ms).
  • Speaker Identification: Accurately identify and label speakers in multi-speaker audio files (2–8 interlocutors).
  • Tagging and Annotation: Apply correct tags for non-speech events—like (laughs) or (applause)—and unintelligible segments.

Skills and Qualifications

  • Native-level fluency: You must be a native speaker of the assigned language with a deep understanding of cultural nuances and regional accents.
  • Attention to Detail: You can distinguish between "clean" speech and "verbatim" speech (e.g., typing "I- I- I don't know" instead of "I don't know").
  • Tech Savvy: You are comfortable learning and navigating new web-based annotation tools.

Engagement Details

  • Location: Remote (Global)
  • Volume: Steady task flow available for high-quality contributors. (Note: Additional details around the project will be provided as they become available.).
  • Flexibility: Work on your own schedule, provided quality, consistency, and deadline standards are met.
  • Type: Freelance/Independent Contractor

Why this matters 

Your expertise will guide how AI systems handle complex logic and human-centered communication. By transcribing and refining audio and text and responses, you’ll help ensure that AI is not only accurate but also clear, safe, and engaging for professional use.


 

Back to blog