This portal contains sharable data from NSF-funded project # IBSS-1519908, "Communication in the Global University: A Longitudinal Study of Language Adaptation at Multiple Timescales in Native- and Non-Native Speakers." These data were collected longitudinally using multiple methods and tasks, from over 100 non-native speakers of English. The speakers were ITAs (international teaching assistants) enrolled in STEM Ph.D. programs at Stony Brook University on Long Island, NY. Each participated for two years in one of three waves of data collection between 2015 and 2019, beginning shortly after arriving in the U.S. from their home country. Each completed repeated assessments and provided recordings at regular intervals over four semesters, resulting in a variety of types of data. Because Mandarin is the most common native language spoken by ITAs at Stony Brook, five sets of "baseline" data were collected from the Mandarin speakers (upon arrival, and at the end of each of the next four semesters). For native speakers of other languages, three sets of data were collected (upon arrival, and at the end of the first and second academic years). Each data collection session lasted approximately 2 hours. Survey data were collected also, but are not released here due to privacy concerns. Some of the ITAs also participated in ethnographic recordings during which they were observed teaching students in lab classes; transcripts of those sessions but not videos are available due to privacy concerns.
See <https://www.stonybrook.edu/commcms/ita-grant/index.php> for more information and additional project materials.
This portal includes data from the following:
|
Data Type or Task |
Data Format |
1. |
Speakers |
Spreadsheet (.xlsx) |
2. |
Reading from a word list |
sound files (.wav) |
3. |
Focus dialogues |
sound files (.wav) |
4. |
Spontaneous speech |
sound files (.wav) |
5. |
Ethnographic transcripts |
|
This material is based upon work supported by NSF under Grant # IBSS-1519908.