Phoneme-Level Speech API

Speech-to-IPA & Pronunciation Assessment API

Langcraft provides a developer-first API that converts spoken audio into IPA phonemes with millisecond timestamps, pronunciation confidence, and mispronunciation detection. It is purpose-built for high-volume language learning apps, oral assessments, and real-time speech feedback experiences.

Key API Features

  • Accurate Transcription: Speech-to-IPA with per-phoneme start and end timestamps.
  • Scoring: Confidence scores and articulatory similarity metrics for automated pronunciation assessment.
  • Alignment: Forced alignment of audio to reference text to detect mispronunciations and fluency gaps.
  • Format Support: Native support for WebM, WAV, MP3, and M4A uploads.
  • Integration: Simple REST endpoint: POST https://api.langcraft.world/transcribe

Get started

Review the API documentation, try the live demo, or request an API key to integrate phoneme-level speech analysis into your product.

Read the API docs Test the live demo Request an API key