CABank German CallFriend Corpus


Linguistic Data Consortium
University of Pennsylvania

Participants: 120
Type of Study: naturalistic
Location: Canada / USA
Media type: audio
DOI: doi:10.21415/T5CT2H

Media folder

Citation information

Some citation here.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Project Description

The CallFriend German corpus of telephone speech was collected by the Linguistic Data Consortium primarily in support of the project on Language Identification (LID), sponsored by the U.S. Department of Defense.

This release of the CallFriend German corpus consists of 60 unscripted telephone conversations between native speakers of German. The recorded conversations last up to 30 minutes. All speakers were aware that they were being recorded. They were given no guidelines concerning what they should talk about. Once a caller was recruited to participate, he/she was given a free choice of whom to call. Most participants called family members or close friends. All calls originated in either the United States or Canada.

Speakers were solicited by the LDC to participate in this telephone speech collection effort via the internet, publications (advertisements), and personal contacts. A total of 100 call originators were found, each of whom placed a telephone call via a toll-free robot operator maintained by the LDC. Access to the robot operator was possible via a unique Personal Identification Number (PIN) issued by the recruiting staff at the LDC when the caller enrolled in the project. The participants were made aware that their telephone call would be recorded, as were the call recipients. The call was allowed only if both parties agreed to being recorded. Each caller was allowed to talk up to 30 minutes. Upon successful completion of the call, the caller was paid $20 (in addition to making a free long-distance telephone call). Each caller was allowed to place only one telephone call. After a successful call was completed, a human audit of each telephone call was conducted to verify that the proper language was spoken, and to check the quality of the recording.
FileSexAgeEdPlacephone
ge_1082
ge_1672
ge_2100
ge_4008F3220Sindelfingen310453fgr
ge_4155M2618Kassel914945ocs
ge_4312F3214Mainz502942ojq
ge_4411F2313Germany914471umm
ge_4494M2418Bonn01144117942fam
ge_4497M4917Kassel305563xir
ge_4499F3417Muenster417271cfu
ge_4525M2618Munich510849ghg
ge_4572F8116Essen407638fgq
ge_4592F3713Baden Baden202745sea
ge_4636F5116Duesseldorf602648xij
ge_4989M3420Augsburg701839kaf
ge_4999M2418Bad Alzunge503344lfb
ge_5021F2216Wuppertal301982nki
ge_5042M2418Cologne714496shj
ge_5178M1611Cologne814862rby
ge_5226F2217Berlin301982nki
ge_5843F2418Cologne407657all
ge_6056M2517Berlin919968vjx
ge_6060M5710Duesseldorf813867khr
ge_6136F1913Neumuenster207866olm
ge_6360F6012Palestine519842fop
ge_6369F4218Hessen213658yoo
ge_6390F7018Augsburg610544mai
ge_6394M5416Braunschweig613836ocn
ge_6397F5912Hamburg218525gcj
ge_6411F6717Bad_Wimpfen310370udl
ge_6449M3710Allersberg215698hag
ge_6464F6115Braunschweig407452som
ge_6473F3424Cologne810661tgu
ge_6481F6918Saaz904785ujo
ge_6522F3816Gottingen313761dga
ge_6537F2516Nuernberg216630qdl
ge_6538M7718Cologne602584emo
ge_6548F6817Berlin517349tje
ge_6573F6614Berlin703893yer
ge_6577F1912Constance207783wor
ge_6590F3711Austria703960rgy
ge_6595M1712Munich860763paw
ge_6610F6914Nuernberg512244tol
ge_6644M2415Markdorf909592ylo
ge_6654M4917Heidenheim313994rit
ge_6669M6016Berlin708647jbp
ge_6683M2619Bierbach908665vgm
ge_6684M2618Burgsal318386mik
ge_6695M3123Munich512302ibw
ge_6718M7016Berlin313878rhu
ge_6780M2414Stuttgart305667opg
ge_6806M3019Oldenburg215727lfi
ge_6831M2415Stuttgart301422oee
ge_6832M2516Schwenningen703276jpt
ge_6846M2721Tuebingen816627gje
ge_6853M2417Stuttgart301422oee
ge_6918F6016Berlin312275vln
ge_6940M2315Stuttgart847432iai
ge_6972M3017Flensburg215387rjq
ge_7038M2616Hoefen860745ppp