Spanish Full-Duplex Conversation Dataset
MarketplaceTwo-speaker Spanish conversations spanning Latin American and European dialects, captured in stereo full-duplex with natural overlap.
Overview
Naturalistic, two-speaker Spanish conversations captured at studio quality in full-duplex stereo. Pairs of native Spanish speakers from Spain, Mexico, Colombia, Argentina, and other Latin American regions discuss everyday topics for the full duration of the session — no read scripts, no scene cuts. Each recording preserves real overlapping speech, backchannels, hesitations, and code-switching, so downstream models train on the way Spanish actually sounds in the wild. Every clip is collected from paid contributors with explicit consent, scene-level provenance, and metadata for speaker demographics, dialect, and acoustic environment.
Key highlights
- 01
Castilian Spain, Mexican, Colombian, and Argentinian Spanish pairings with explicit dialect tags per speaker.
- 02
Voseo, tuteo, and ustedeo register switches captured naturally per regional norm — not flattened into a single neutral form.
- 03
Spanglish code-switching from US-based and border-region bilingual speakers preserved at the utterance level.
- 04
Idiomatic phrases (modismos), regional slang (jerga), and rapid-fire interjections tagged for ASR robustness.
Technical specifications
Coverage
Hundreds of paired sessions from native Spanish speakers across Latin America and Spain — coverage extends to bespoke dialects, age groups, and topical targets on request.
Capture specs
Stereo full-duplex audio at 48 kHz / 24-bit per channel from studio-grade microphones, with per-speaker channel isolation, calibrated noise floor, and continuous capture for the full lifespan of each session — not cherry-picked moments.
Annotations
Speaker / expert metadata shipped with every session: age, gender, region, dialect, native language, and acoustic environment. Annotations available at request.
Use cases
- Full-duplex conversational AI training and evaluation
- Speaker diarization and Spanish ASR / TTS modelling
- Turn-taking, backchannel, and overlap-handling research
- Voice agent benchmarks for natural, multi-party conversation
Request samples
Share your use case and we'll send sample clips, pricing, and recommended next steps for your pipeline.
More datasets
Full-Duplex Conversational Audio
American English Full-Duplex Two-Speaker Conversational Dataset
Two-speaker American English conversations captured in full-duplex stereo, covering everyday topics with overlapping speech, backchannels, and natural disfluencies preserved.
Full-Duplex Conversational Audio
French Full-Duplex Conversation Dataset
Naturalistic French conversations between native speakers, captured in full-duplex stereo with overlapping speech and authentic turn-taking.
Full-Duplex Conversational Audio
Mandarin Full-Duplex Conversation Dataset
Native-speaker Mandarin Chinese conversations recorded in full-duplex stereo across mainland and overseas dialect regions.
Full-Duplex Conversational Audio
Vietnamese Full-Duplex Conversation Dataset
Native Vietnamese conversations captured in full-duplex stereo, with North-Central-South dialect coverage and natural turn-taking.