last updated: 13.03.09
I wish to THANK all the volunteers and transcribers for their cooperation and immense input, without which the project would not have started at all!
PLINDSEI is a Polish project launched in 2005 as a branch of international LINDSEI project.
Broadly speaking, it is a corpus of spoken learner English of Polish native speakers.
PLINDSEI in stages:
STAGE I:
- In its early stages, it consisted in collecting 50 interviews, from among volunteers at the School of English (IFA), Adam Mickiewicz University. They were all 3rd or 4th English Philology students at that time.
- The interviews were taken along the lines of LINDSEI guidelines.
- The interviews were recorded by means of a digital recorder.
- Each interview lasted for about 15 minutes.
- The interviewees were asked to sign a form where they agreed for the data to be used for scientific purposes.
- Later on, the interviews were transcribed, using SoundScriber and OpenOffice software. The transcription process took 7 months of everyday work and was finalized thanks to a group of seminar students of prof. Dziubalska-Kołaczyk.
- It needs to be stressed that all spontaneous speech phenomena, such as repetitions, false starts, hesitations, etc. are reflected in the transcription.
- Interviewer (A) & interviewee (B) turns are marked as to enable research on B-turns only.
STAGE III:
- final transcription check
- division into 3 parts along the lines of LINDSEI interview guidelines
- digitalizing database of interviewees' social variables
LINDSEI corpus is going to be published on a CD-rom (together with a program for its use)
in 2009!
STAGE IV:
- PLINDSEI annotation - in the process of POS tagging with CLAWS7 (March 2009)
interested? mail wjoanna AT ifa.amu.edu.pl
ZAJeK presentation at IFA 25.04.08 handout here