CABank Japanese CallHome Corpus

Participants:	120
Type of Study:	phone call
Location:	United States
Media type:	audio
DOI:	doi:10.21415/T5H59V

Citation information

Some citation here.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Project Description

This is the Japanese portion of CallHome.

Speakers were solicited by the LDC to participate in this telephone speech collection effort via the internet, publications (advertisements), and personal contacts. A total of 200 call originators were found, each of whom placed a telephone call via a toll-free robot operator maintained by the LDC. Access to the robot operator was possible via a unique Personal Identification Number (PIN) issued by the recruiting staff at the LDC when the caller enrolled in the project. The participants were made aware that their telephone call would be recorded, as were the call recipients. The call was allowed only if both parties agreed to being recorded. Each caller was allowed to talk up to 30 minutes. Upon successful completion of the call, the caller was paid $20 (in addition to making a free long-distance telephone call). Each caller was allowed to place only one telephone call.

Although the goal of the call collection effort was to have unique speakers in all calls, a handful of repeat speakers are included in the corpus. In all, 200 calls were transcribed. Of these, 80 have been designated as training calls, 20 as development test calls, and 100 as evaluation test calls. For each of the training and development test calls, a contiguous 10-minute region was selected for transcription; for the evaluation test calls, a 5-minute region was transcribed. For the present publication, only 20 of the evaluation test calls are being released; the remaining 80 test calls are being held in reserve for future LVCSR benchmark tests.

After a successful call was completed, a human audit of each telephone call was conducted to verify that the proper language was spoken, to check the quality of the recording, and to select and describe the region to be transcribed. The description of the transcribed region provides information about channel quality, number of speakers, their gender, and other attributes.

File Sex Age Age Place
ja_0856
ja_0924 38 16
ja_0930
ja_1012 31 16
ja_1032
ja_1041
ja_1048 41
ja_1057 41 18
ja_1099
ja_1109
ja_1123
ja_1201
ja_1237 37 21
ja_1263 37 21
ja_1277 33
ja_1288 43 20
ja_1290 16
ja_1328 29 14
ja_1369 25 12
ja_1370 26 12
ja_1418
ja_1425 16
ja_1428 30 16
ja_1461 34 14
ja_1509 30 16
ja_1538 33 14
ja_1541
ja_1542 35 18
ja_1557 16 10
ja_1593 26 14
ja_1604 28 19
ja_1607
ja_1608 45 12
ja_1615 13
ja_1628 40 14
ja_1642 12
ja_1667 44 16
ja_1710 31 16
ja_1713 24 17
ja_1725 12
ja_1731 63 12
ja_1738 30 16
ja_1741 27 15
ja_1749 18
ja_1889 16
ja_1899 22 12
ja_1925 19 14
ja_1928 13
ja_1999 31 16
ja_2004 58 12
ja_2041
ja_2085 40 17
ja_2096 36 17
ja_2111 19 12
ja_2134 18 13
ja_2157 13
ja_2180 36 16
ja_2188 18
ja_2199
ja_2204 25 12
ja_2206 22 15
ja_2207 31 12
ja_2208 33 16
ja_2209 82 8
ja_2210 28 14
ja_2212 61 10
ja_2215 45 12
ja_2217 65 15
ja_2218 47 16
ja_2219 28 18
ja_2220 43 22
ja_2222
ja_2224 29 20
ja_2225 54 12
ja_2231 31 20
ja_2234 26 16
ja_2235 50 12
ja_2237 29 16
ja_2239 29 14
ja_2243 18 12
ja_0743
ja_0922
ja_0988
ja_1003
ja_1069
ja_1622
ja_1629 54 14
ja_1670 28 21
ja_1688 21 12
ja_1690 19 11
ja_1967 16
ja_2035 30 14
ja_2214
ja_2238 29 23
ja_3002
ja_3004
ja_3005
ja_3008
ja_4061
ja_4275
ja_0696
ja_0862
ja_0986 32 16
ja_1005
ja_1072 34 16
ja_1586 35 18
ja_1674 54 16
ja_1832 19 13
ja_1867 30 16
ja_1966 27 16
ja_2053 14
ja_2074 46 16
ja_2196 28 19
ja_2216 25 18
ja_2223 36 16
ja_2236 13
ja_2242 48 14
ja_3001
ja_3006
ja_3007

Acknowledgements

Andrew Yankes reformatted this corpus into accord with current versions of CHAT.