layout: true background-image: url(theme/WA_logo_xaringan_small.png), url(theme/ncn-logo-poziom-en.png), url(theme/ICPhS2019-transparent.png) background-position: 12% 89%, 48% 85%, 89% 90% background-size: 9%, 40%, 15% --- class: clear, center, middle .font200[Durational variation in Polish fricatives provides evidence for hybrid models of phonology] .font120[Kamil Kaźmierski<br> Faculty of English at AMU in Poznań] ICPhS :: August 9th, 2019
kamil.kazmierski@wa.amu.edu.pl <br>
wa.amu.edu.pl/wa/kazmierski_kamil --- layout: true background-image: url(theme/WA_logo_xaringan_small.png) background-position: 2% 98% background-size: 5% --- class: middle, keep-h1-up, font200 # Neighborhood density vs. phonotactic probability - Spoken-word recognition .ref[(Vitevitch & Luce 1998)]: - Higher neighborhood density → Slower recognition (lexical effect) - Higher phonotactic probability → Faster recognition (sublexical effect) - Both lexical and sublexical effects → support for hybrid phonology .ref[(Pierrehumbert 2002)] - Do these effects have consequences for the acoustics of speech production? --- layout: true background-image: url(theme/WA_logo_xaringan_small.png) background-position: 2% 98% background-size: 5% --- class: font200, middle, keep-h1-up # Neighborhood density No. of lexemes created by removal, addition, or substitution of a single phoneme .font90[ .pull-left[ ## Low: *więc* .ipa[/vjɛnt͡s/] 'therefore' - removal: *wiec* .ipa[/vjɛt͡s/] - addition: Ø - substitution: Ø ] .pull-right[ ## High: *stałe* .ipa[/ˈstawɛ/] 'constant' - *stał* .ipa[/staw/] - *stałeś* .ipa[/ˈstawɛɕ/], *stałem* .ipa[/ˈstawɛm/] - *stare* .ipa[/ˈstarɛ/], *stale* .ipa[/ˈstalɛ/], *stała* <br> .ipa[/ˈstawa/], *stało* .ipa[/ˈstawɔ/] ] ] --- class: font200, middle, keep-h1-up # Phonotactic probability Sum of log frequencies of diphones across the corpus .ref[(Vitevitch & Luce 2004)] .pull-left[ ## Low: *chcą* .ipa[/xt͡sɔw̃/] 'they want' - .ipa[.ref[xt͡s]ɛ], .ipa[.ref[xt͡s]ɛʂ] - Ø ] .pull-right[ ## High: *wie* .ipa[/vjɛ/] 'she/he/it knows' - .ipa[.ref[vj]ɛɕtɕ, .ref[vj]ast, .ref[vj]ɔzɛ, .ref[vj]ɛm, .ref[vj]ɛmɨ], ... - .ipa[b.ref[jɛ]da, z.ref[jɛ]m, p.ref[jɛ]s], ... ] --- class: middle, clear ## **Research question**: Are the differential effects of phonotactic probability and neighborhood density present in *production*? ## **Motivation**: Perception-production symmetry? Language specificity? Beyond lab speech? ## **Predicition**: High neighborhood density will *increase* and high phonotactic probability will *decrease* fricative durations. --- layout: false class: center, middle, inverse # Method --- layout: true background-image: url(theme/WA_logo_xaringan_small.png) background-position: 2% 98% background-size: 5% --- class: center, font180, clear # Data .pull-left[ .float-vertical-align[ Greater Poland Spoken Corpus 68 Speakers; 69,720 Word tokens [**wa.amu.edu.pl/korpuswlkp**](http://wa.amu.edu.pl/korpuswlkp) .ref[(Kaźmierski, Kul & Zydorowicz in press)] ] ] .pull-right[ <img src="media/map_combined_xaringan_background.png" width="80%" height="80%" style="display: block; margin: auto;" /> ] --- # Data extraction ## <img style="float:left" src="theme/praat2.png" /> Transcripts hand-aligned at breath-group level in Praat .ref[(Boersma & Weenink 2016)] <br><hr><br> ## <img style="float:left" src="theme/labb-cat2.png" />Corpus creation, management and querying with LaBB-CAT .ref[(Fromont & Hay 2012)] ## → Force-aligned at word and phoneme level ## → Fricative initial (.ipa[ɕ ʂ ʑ f s v x z]) `C₁C₂V...`content words --- class: font160, middle, keep-h1-up # Model architecture (*N* = 2,850) Mixed-effects linear regression with *lme4* .ref[(Bates et al. 2015)] in <svg style="height:0.8em;top:.04em;position:relative;fill:#76232F;" viewBox="0 0 581 512"><path d="M581 226.6C581 119.1 450.9 32 290.5 32S0 119.1 0 226.6C0 322.4 103.3 402 239.4 418.1V480h99.1v-61.5c24.3-2.7 47.6-7.4 69.4-13.9L448 480h112l-67.4-113.7c54.5-35.4 88.4-84.9 88.4-139.7zm-466.8 14.5c0-73.5 98.9-133 220.8-133s211.9 40.7 211.9 133c0 50.1-26.5 85-70.3 106.4-2.4-1.6-4.7-2.9-6.4-3.7-10.2-5.2-27.8-10.5-27.8-10.5s86.6-6.4 86.6-92.7-90.6-87.9-90.6-87.9h-199V361c-74.1-21.5-125.2-67.1-125.2-119.9zm225.1 38.3v-55.6c57.8 0 87.8-6.8 87.8 27.3 0 36.5-38.2 28.3-87.8 28.3zm-.9 72.5H365c10.8 0 18.9 11.7 24 19.2-16.1 1.9-33 2.8-50.6 2.9v-22.1z"/></svg> .ref[(R Core Team 2019)] - Response: + `Fricative duration` - Predictors of theoretical interest: + `Neighborhood density`, `Phonotactic probability` Phonological CorpusTools .ref[(Hall et al. 2018)] - Control predictors: + `Average speaking rate`, `Rate deviation`, `Gender`, `Word duration`, `Frequency`, `Prefix`, `Stress` - Random terms: + `(1 | Fricative)`, `(1 + Neighborhood density + Phonotactic probability | Speaker)` --- layout: false class: center, middle, inverse # Results --- layout: true background-image: url(theme/WA_logo_xaringan_small.png) background-position: 2% 98% background-size: 5% --- # Divergent effects of Neighb. dens. and Phon. prob. <img src="index_files/figure-html/effect_plots-1.png" width="100%" height="100%" style="display: block; margin: auto;" /> --- class: middle, keep-h1-up, font200 # Conclusions Within the same dataset: - neighborhood density effect → Lexical level - phonotactic probability effect → Sublexical level - the presence of both → Hybrid model of phonology .ref[(Goldinger 2007; Pierrehumbert 2002; Ernestus 2014)] --- class: center, middle, keep-h1-up # Thank you! ## Neighborhood density and phonotactic probability show differential effects on duration of Polish fricatives <img src="index_files/figure-html/effect_plots_rep-1.png" width="75%" height="45%" style="display: block; margin: auto;" /> This research was supported by National Science Center (Poland) grant no. UMO-2017/26/D/HS2/00027
kamil.kazmierski@wa.amu.edu.pl --- class: middle # Remaining fixed effects in the model <table> <thead> <tr> <th style="text-align:left;"> Predictor </th> <th style="text-align:right;"> B </th> <th style="text-align:left;"> p </th> </tr> </thead> <tbody> <tr> <td style="text-align:left;"> Rate deviation </td> <td style="text-align:right;"> -0.038 </td> <td style="text-align:left;"> < 0.001 </td> </tr> <tr> <td style="text-align:left;"> Average rate </td> <td style="text-align:right;"> -0.049 </td> <td style="text-align:left;"> < 0.001 </td> </tr> <tr> <td style="text-align:left;"> Gender Male </td> <td style="text-align:right;"> -0.018 </td> <td style="text-align:left;"> 0.557 </td> </tr> <tr> <td style="text-align:left;"> Word duration </td> <td style="text-align:right;"> 0.920 </td> <td style="text-align:left;"> < 0.001 </td> </tr> <tr> <td style="text-align:left;"> Frequency </td> <td style="text-align:right;"> 0.005 </td> <td style="text-align:left;"> 0.627 </td> </tr> <tr> <td style="text-align:left;"> Prefix TRUE </td> <td style="text-align:right;"> -0.117 </td> <td style="text-align:left;"> < 0.001 </td> </tr> <tr> <td style="text-align:left;"> Stress TRUE </td> <td style="text-align:right;"> 0.078 </td> <td style="text-align:left;"> < 0.001 </td> </tr> </tbody> </table>