Journal of Child Language http://journals.cambridge.org/JCL Additional services for Journal

of Child

Language: Email alerts: Click here Subscriptions: Click here Commercial reprints: Click here Terms of use : Click here

Finding words and word structure in articial speech: the development of infants' sensitivity to morphosyntactic regularities ERIKA MARCHETTO and LUCA L. BONATTI Journal of Child Language / FirstView Article / March 2015, pp 1 - 30 DOI: 10.1017/S0305000914000452, Published online: 10 October 2014

Link to this article: http://journals.cambridge.org/abstract_S0305000914000452 How to cite this article: ERIKA MARCHETTO and LUCA L. BONATTI Finding words and word structure in articial speech: the development of infants' sensitivity to morphosyntactic regularities. Journal of Child Language, Available on CJO 2014 doi:10.1017/ S0305000914000452 Request Permissions : Click here

Downloaded from http://journals.cambridge.org/JCL, IP address: 138.251.14.35 on 21 Mar 2015

J. Child Lang., Page  of . © Cambridge University Press  doi:./S

Finding words and word structure in artificial speech: the development of infants’ sensitivity to morphosyntactic regularities* E RI K A M A R CH E T TO SISSA/ISAS, Trieste, Italy Laboratoire de Sciences Cognitives et Psycholinguistique (LSCP), Ecole Normale Supérieure, Paris, France AND

L U CA L . B O N ATT I ICREA and Universitat Pompeu Fabra, Centre for Brain and Cognition, Barcelona, Spain (Received  February  – Revised  January  – Accepted  June )

ABSTRACT

To achieve language proficiency, infants must find the building blocks of speech and master the rules governing their legal combinations. However, these problems are linked: words are also built according to rules. Here, we explored early morphosyntactic sensitivity by testing when and how infants could find either words or within-word structure in artificial speech snippets embodying properties of morphological constructions. We show that -month-olds use statistical relationships between syllables to extract words from continuous streams, but find word-internal regularities only if the streams are segmented. Sevenmonth-olds fail both tasks. Thus, -month-olds infants possess the resources to analyze the internal composition of words if the speech contains segmentation information. However, -month-old infants may not possess them, although they can track several statistical relations. This developmental difference suggests that morphosyntactic sensitivity may require computational resources extending beyond the detection of simple statistics. [*] This work was supported by grants PSI- and FVG ‘PsyScope XL’ to L.L.B, and by the Fyssen Foundation Grant to E.M. We thank A. Isaja, L. Filippin, F. Gandolfo, M. Sjekloca, K. Brink, N. Sebastián Gallés, A. Endress, K. Mehta, and J.M. Toro for scientific discussions and technical support. Address for correspondence: Luca L. Bonatti, ICREA and Universitat Pompeu Fabra, C. Roc Boronat, , Edifici Tanger, .,  Barcelona, ES. e-mail: [email protected]



MARCHETTO AND BONATTI INTRODUCTION

The multifaceted nature of word learning At a very young age, children learn to master the complex web of words and rules with which a natural language is woven. However, this is a daunting challenge. To understand the complexity it presents, consider that infants must solve two problems in a relatively short time. First, to achieve language proficiency, infants must find the words embedded in a speech stream. However, there are no clear signs of segmentation that separate words in spoken language. Second, to use language productively, they must master the system of rules and determine how the finite number of linguistic elements combines to form novel structures, by first recognizing and eventually generating new complex combinations from the words extracted from the stream. Yet the problems of finding the basic elements of language and that of identifying the structural information that makes language productive are not independent. Sentences are composed in a systematic way, and knowing how they are composed (e.g. knowing that noun phrases can be exchanged with other noun phrases, or verb phrases with other verb phrases, without loss of grammaticality) is at the root of language productivity. However, words are also composed in a systematic way. Consider the word unexpectedly. In this word, a prefix and a suffix are attached to the stem expect to modify its meaning and its syntactic category. While the internal structure of sentences is crucial to explain the productivity of language, the internal structure of words is also crucial because words can also be productive: if unexpectedly is a word, so is unhappily or unintentionally. Indeed, whether syntactic relations are realized between or within words varies dramatically among languages (Payne, ). Some languages, such as English, have a morphology that is relatively spare, whereas other languages exhibit various degrees of morphosyntactic complexity, expressing many syntactic relations directly within words. These relationships can range from basic grammatical facts, realized by agglutinating or fusing affixes with roots to convey the case, tense, gender, or number (e.g. the Italian mangio/mangia ‘I eat’/‘he/she eats’, with the last vowel encoding the person and tense, or the Italian bambina/ bambine ‘girl’/‘girls’, with the last vowel encoding the gender and number), to a wide range of grammatical and semantic properties, such as in highly agglutinative languages, which can form complex expressions with one word (e.g. inikwihlminhih’isita ‘several small fires were burning in the house’ in Nootka; Aronoff & Fudeman, ). A further complication is that syntactic relations, both when they are realized between words, such as in sentences, or within words as morphological phenomena, often occur among non-adjacent elements. This fact is evident in sentences, which require a listener to keep track of dependencies between 

FINDING WORDS AND WORD STRUCTURE

elements separated by intervening lexical items. However, the same phenomenon occurs in morphology. Several languages contain parasynthetic constructions, in which novel words are created by simultaneously adding different affixes which may fuse with a root. For example, in Italian, the word arrossito ‘blush’ is created by adding the prefix–suffix couple ad (+ phonological regressive assimilation with r) and ito to the root ross. Because lexical items, such as arrosso or rossito, do not exist, the prefix– suffix combination is applied simultaneously to the root; hence, this creates a non-adjacent relationship between syllables within the word. This diffuse and productive construction in Slavic and Romance languages (Vilares, Cabrero & Alonso, ; Bisetto & Melloni, ; Lavale Ortiz, ) is also present in English, as the words downhearted, embolden, or able-bodied exemplify.

Words, word structure, and distributional information The examples above show that even the first steps into the construction of a lexicon contain several layers of complexity, spanning from the need to identify words inside a continuous speech stream to the search for the internal structure of words. An infant has to cope with all of these factors when analyzing the unpredictable and difficult-to-interpret speech stream produced around her. This input does not allow her to clearly separate the search for words and the search for word structure because words often contain syntactic information: the problems of finding words and finding rules within words must be solved together. This paper investigates the origin of these abilities in infants. Specifically, we focused on when and under what conditions infants become sensitive to within-word relations, possibly requiring the establishment of non-adjacent relations between word syllables. We focused on - and -month-old infants. These ages are interesting to consider for several reasons. At approximately  months of age, infants can already track various distributional properties to organize linguistic and non-linguistic continua. Notably, they can use the conditional frequencies with which syllables strictly follow each other (that is, their ‘transitional probabilities’; hereafter, TPs) to begin segmenting a continuous stream into word-like units. They can exploit distributional information both when exposed to artificial speech (Saffran, Aslin & Newport, ; Aslin, Saffran & Newport, ) and when exposed to real speech (Pelucchi, Hay & Saffran, ), suggesting that TP computations could indeed be involved in real-world language acquisition. Eight-month-olds can also identify morphemes and functional parts of a language, which is potentially useful to start decoding the language’s morphosyntax (Shi, Werker & Cutler, ; Gervain, Nespor, Mazuka, Horie & Mehler, a; Shi & Lepage, ). 

MARCHETTO AND BONATTI

These abilities could help in solving the problem we investigate in the current paper. However, as we discussed above, infants need to learn much more than unanalyzed syllable chunks to achieve the language competence typical of even very young toddlers. They need to extract structure from tokens. To do this, they must track relations within subcomponents of words, separating stems from the productive parts of the lexicon. At approximately  months of age, infants begin the process of consolidating basic elements of the natural language they will soon speak, beginning their journey from organized perception to organized production. Around their first birthday, they have tuned their discrimination abilities to the sound patterns of their language (Werker & Tees, , ); they can identify language-specific functors (Shi et al., ), they begin building their lexicon deploying word-specific learning strategies (Stager & Werker, ; Halberda, ; Johnson, Jusczyk, Cutler & Norris, ), and they start being sensitive to how grammatical properties of words determine their syntactic roles (Hirsh-Pasek & Golinkoff, ). Between  and  months of age, infants also show the ability to recover structural/ grammatical information from natural and artificial speech, both when such information is contained in a sentence and when it is contained inside a word. Thus, when exposed to an artificial language, -month-old German infants can categorize a novel word as a content word if it follows a determiner, suggesting a skilled ability to identify parts of the signal carrying grammatical information even before the development of an extended lexicon (Höhle, Weissenborn, Kiefer, Schulz & Schmitz, ). At approximately the same age, infants can analyze the morphosyntactic composition of some natural language words. Mintz () exposed -month-old infants to naturally pronounced pseudo-words containing either a real natural language affix (such as the English -ing in the pseudo-word Gorping), or an affix non-existing in their language (such as -ot in Gorpot). Infants only needed to hear a few pseudo-words containing the real suffix to process it as a separate entity from the potential root Gorp, although the root and suffix were heard as agglutinated in a single lexical item. However, this capacity may still not be fully functional to identify morphosyntactic patters requiring infants to track non-adjacent relations. For example, even if infants can represent -ing as a suffix as of  months, before  months they may not be able to track non-adjacent dependencies involving such a suffix, such as those existing between auxiliaries and verb endings (is . . . -ing; Santelmann & Jusczyk, ). Finally, under certain conditions, infants can find arbitrary relations at a distance among artificial tokens composing a potential sentence. Gómez and her collaborators familiarized infants with sets of separate pseudo-words in which the items were connected by either adjacent relations (e.g. alt comoo, alt kicey, . . .) or non-adjacent relations (e.g. pel wadim rut, pel kicey rut, . . .). 

FINDING WORDS AND WORD STRUCTURE

At approximately  to  months of age, infants could exploit both types of dependency (Gómez & Gerken, ; Gómez, ; Gómez & Lakusta, ; Gómez & Maye, ), but only when they were exposed to a sizable measure of variability for the intervening item. While these last results seem to suggest that non-adjacent relations cannot be easily captured before  months, infants in these experiments were familiarized with sets of separate words. By contrast, the relations typical of many morphological processes are established within words. Perhaps the ability to detect between-word and within-word non-adjacent relations appear together, but this is not known. In sum, the potentialities to process word and word structure are available early, but we still do not have a clear picture of when and under what input conditions infants start to find structures within words, thus becoming capable of identifying the regularities typical of morphosyntactic processes.

The onset of sensitivity to word structure: the current research At the early stages of speech production, children show a mastery of a wide array of relatively abstract syntactic and morphosyntactic properties. For example, when children begin producing two-word sentences between · and · years of age, they respect the order of their native language; this is a sign indicating mastery of the Head Direction Parameter (Guasti, ). They use order information to interpret novel sentences, even at the cost of making errors (Gertner & Fisher, ). At the same time, when they speak morphosyntactically rich languages, such as Italian (Pizzuto & Caselli, ; Guasti, ; Caprin & Guasti, ), Spanish or Catalan (Torrens, ), toddlers and children produce sentences by respecting morphological regularities, such as subject–verb or tense and case agreements, which imply a mastery of parts of the morphosyntax. Thus, by the onset of linguistic production, infants analyze words as being composed of stems and affixes. However, as discussed above, the resources to identify words in the speech stream are available earlier. What happens between the periods when infants start tracking statistical relations and when they start producing words? Specifically, at what point and via what resources do infants begin organizing their lexicon systematically? In learning a lexicon that in a few months will grow spectacularly, infants would greatly benefit from grasping word structures as they are building their native language vocabulary. Thus, it is reasonable to suppose that as early as they begin focusing on their native language and can create the appropriate phonological representation to parse sounds into words, infants may attempt to also parse the internal structure of the words they begin acquiring. 

MARCHETTO AND BONATTI

Artificial speech as a tool to investigate natural speech processing Our experiments aimed to probe the cognitive precursors grounding the ability to learn morphosyntax, by testing when infants become sensitive to word-internal structure and under what input conditions this can be accomplished. We studied how infants learn within-word relations after exposure to artificial language streams. While extrapolations from artificial to natural language learning must be performed with care, artificial language learning offers several advantages. It is possible to control a series of parameters, such as the prosody, speech rate, or previous exposure to the stimuli, which may allow us to better understand aspects of language acquisition that are difficult to capture with natural language. Furthermore, a consistent set of studies has found that the processes triggered during exposure to artificial speech are not very different from those active when participants listen to natural speech. First, neural evidence suggests that the brain areas and the connectivity patterns elicited by artificial speech largely overlap with those activated while listening to natural speech (De Diego-Balaguer, Toro, Rodriguez-Fornells & BachoudLevi, ; de Diego-Balaguer & Lopez-Barroso, ; Lopez-Barroso, de Diego-Balaguer, Cunillera, Camara, Münte & Rodriguez-Fornells, ; López-Barroso, Catani, Ripollés, Dell’Acqua, Rodríguez-Fornells & de Diego-Balaguer, ) both in adults and children (McNealy, Mazziotta & Dapretto, , ). Second, in several cases, the processes involved in artificial language learning have been shown to mimic processes involved in natural language acquisition. This has been observed both in word learning and in tasks aimed at studying sensitivity to syntactic structure. Thus, while calling a syllable sequence extracted from an artificially synthesized stream a ‘word’ may stretch the meaning of the word ‘word’, under appropriate conditions, infants are willing to accept such sequences as meaning-carrying sounds (Lany & Saffran, ). Likewise, -month-old infants prefer to listen to snippets of artificial speech which are coherent with the distribution of functors and content words in their natural language (Gervain et al., a; Yoshida et al., ). Importantly, such preference can be elicited simply by familiarizing infants to sequences in which the item frequency mimics the distribution of functors and content words in their natural languages, suggesting that even a very brief exposure to artificial speech elicits the same processes of analysis that infants naturally apply to real speech. Even in the more complex case of bilingual infants learning languages with opposite word orders, artificial speech can be used to reveal learning processes by manipulating the pitch, prosody, and frequency of artificial streams (Gervain & Werker, a). Similar phenomena occur with adult participants (Gervain et al., ). 

FINDING WORDS AND WORD STRUCTURE

These findings and other reasons suggest that the learning processes induced by artificial speech may allow researchers to exercise a good control of stimuli while tapping into processes potentially involved in real morphosyntactic acquisition. The current research is based on this speculative, but grounded, assumption.

AXC languages as a tool to study early sensitivity to the morphosyntactic structure of words in an artificial speech stream Several experiments in artificial language learning have shown that when adult participants are exposed to artificial speech streams that contain even minimal cues of segmentation, they can separate the streams into word-like items; furthermore, they are quickly capable of extracting the structure of these items, even when it is defined by internal, even non-adjacent, word subcomponents, such as separate syllables within the same words. However, when the stream is continuous and segmentation cues are absent, adults are unable to detect the rules of word composition, even if they can still extract words from the stream (Peña, Bonatti, Nespor & Mehler, ; Bonatti, Peña, Nespor & Mehler, ; Endress & Bonatti, ; Toro, Nespor, Mehler & Bonatti, ). For example, Peña et al. () exposed adults to a continuous stream of trisyllabic sequences characterized by internal, non-adjacent TPs of · and a varying middle syllable (for brevity, an ‘AXC language’). The stream could contain sequences such as Puliki, Puraki, or Pufoki, mixed in a continuum of other syllables. The stream was designed so that neither the syllable frequency nor the adjacent TPs were informative to identify words and their structure. However, by computing non-adjacent TPs among syllables, it was possible to accomplish both tasks. In Peña et al.,’s experiments, after exposure to a continuous stream, adults could find the words, but failed to grasp their structure, despite the fact that the same computation may have sufficed to identify it. In contrast, the ability to project regularities about the internal structure of words appeared if participants were familiarized with a segmented stream, even after very brief exposures. In these experiments, the segmentation cues were short pauses inserted between words, but the specific segmentation marks are not crucial. Prosodical phrase markers (Shukla, Nespor & Mehler, ) or final vowel lengthening (Saffran, Newport & Aslin, ) work equally well. Indeed, insofar as learners find signs marking word boundaries, they seem to be able to grasp the internal word structure (Langus, Marchetto, Bion & Nespor, ). The AXC languages designed by Peña et al. () are suitable for investigating the onset and the development of sensitivity to morphosyntax. Such languages contain both statistically coherent units that can be identified by computing TPs between adjacent or non-adjacent syllables 

MARCHETTO AND BONATTI

(henceforth, ‘words’) and morphosyntactic information because the words are characterized by a common internal structure. In the following experiments, we adapted Peña et al.,’s AXC streams to - and -month-old infants. We took as granted that infants could compute adjacent TPs among syllables in artificial languages, because there is evidence that they can do it even at  months of age (Thiessen & Erickson, ). The issue we explore is: What is the developmental course of the simplest next step necessary to break into word structure? Our strategy was to present infants with the easiest contrast that may reveal sensitivity to morphosyntax, asking them to differentiate between ‘words’, or ‘rule-words’, and ‘non-words’. Here, ‘words’ are sequences of three syllables characterized by non-null, but not perfect, adjacent TPs and perfect non-adjacent TPs. ‘Rule-words’ are sequences of three syllables obtained by replacing the middle syllable of words with another syllable. Thus, rule-words never occur in the stream, but maintain the same relations as words among non-adjacent syllables. ‘Non-words’ are sequences of three syllables that never occur in the stream in that sequence and do not maintain the same relations among adjacent or non-adjacent syllables as words. While, by design, such comparisons fall short of addressing many issues related to the origin of morphosyntactic abilities, they have a marked advantage over more difficult comparisons. They test the minimal computations that infants must be able to do to compare items with and without internal structure in the easiest possible way. Thus, successes, as well as failures, are informative about the onset of morphosyntactic abilities. If infants at a given age fail a relatively easy contrast involving a single small step moving beyond adjacent TPs towards the identification of word structure, then they most likely do not possess the computational resources to analyze morphosyntactic patterns. Conversely, a success is an indication that when listening to unknown speech streams containing cues to word segmentation, they do not limit themselves to memorizing token words, but they try to parse their internal structure. We present a series of experiments testing the abilities necessary to differentiate words or rule-words from non-words after exposure to either segmented or continuous artificial speech streams. In Experiments –, we tested sensitivity to the words of an artificial language and to their morphosyntax, after exposure to either a segmented stream (Experiment ) or a continuous stream (Experiments  and ) in -month-old infants. If, at approximately one year, infants begin developing an awareness of the morphosyntax of their crescent lexicon, then we predicted that, similar to adults, they should be able to identify rules within words after exposure to a segmented stream (Experiment ), but not to a continuous stream (Experiment ), even if they can identify the words of the familiarization language (Experiment ). In Experiments –, we assessed to what extent -month-old infants could find the morphosyntactic properties of words in 

FINDING WORDS AND WORD STRUCTURE TA B L E

 . An overview of the experiments and their p-values

Age

Contrast tested

Stream kind

Experiment

p-value

 months

Words vs. Non-words AXC Rule-words vs. Non-words

Continuous Segmented Continuous

 , Supp. Materials 

4· 4.; 4· 5.

 months

Words vs. Non-words

Segmented Continuous Segmented

  

4· 5· 5.

AXC Rule-words vs. Non-words

the same languages. At that age, infants have not completed the process of perceptual reorganization needed to properly represent a language-specific lexicon (Werker & Tees, ). Thus, if an advanced process of convergence towards a native language is a prerequisite for the construction of a structured lexicon, then -month-old infants might not be sensitive to within-word relations despite the fact that they can compute adjacent relations among syllables (Saffran, Aslin, & Newport, ) and could, in principle, identify abstract relations between isolated tokens (Marcus, Vijayan, Rao & Vishton, ). Table  contains a summary of the structure of the experiments and their results. EXPERIMENT  Participants Thirty-two infants from Italian-speaking families, aged ; (M = ;·; Range = ;·–;·) with a minimum APGAR of  and no hearing or vision problems, were included in the analysis ( girls). Twenty-three additional infants were excluded from the analyses ( for fussiness, and  because s/he looked longer than  cumulative s in more than two test trials, on the assumption that such infants were sticky-fixating on our attractor stimuli instead of responding to the test sounds). The criteria for participant selection were identical in all reported experiments. Such a rejection rate is high, but not uncommon. Slaughter and Suddendorf () ran a meta-analysis of infant experiments reporting rejection rates ranging from  to %. These authors showed that such variation does not systematically influence experimental outcomes. Stimuli and procedure We created two artificial speech streams (A and B) by pseudo-randomly concatenating four continuous sequences of three Consonant–Vowel (CV) syllables, with the constraint that one word could not occur twice in a 

MARCHETTO AND BONATTI

 . Items used to compose the familiarization stream and the test phase of Experiment  (the bold face in Rule-Words and Non-Words identifies syllables that appear in the same positions as in the words of the respective familiarization streams) TA B L E

WORDS

RULE-WORDS

NON-WORDS

Stream A

Stream B

Stream A

Stream B

Stream A

Stream B

/bamuso/ /bagaso/ /limufe/ /ligafe/

/feliga/ /febaga/ /solimu/ sobamu/

/baliso/ /bafeso/ /libafe/ /lisofe/

/fesoga/ /femuga/ /sogamu/ /sofemu/

/sogali/ /femuba/ /bafemu/ /lisoga/

Identical to the Rule-Words of Stream A

row. The languages contained two word pairs with identical first and last syllables, differing only in their middle syllable (see Table ). We refer to these pairs as ‘word families’. Thus, the languages could be described as composed of two AXC word families characterized by minimal variability. In each stream, words were repeated  times. TP was  between the first and last syllables of each word and · between adjacent syllables. Adjacent TPs across word boundaries were also ·; thus, adjacent TPs, while non-null, could not be informative about word boundaries. Each trisyllabic sequence was separated from the next by a  ms silence. Test items were either non-words (i.e. sequences of three syllables that never occur in the stream in that sequence) or rule-words (i.e. sequences of three syllables obtained by replacing the middle syllable of words with another syllable). The non-adjacent probability relations among syllables in the two streams were such that the rule-words of Stream A became the non-words of Stream B. To realize this inversion, in Stream A, two non-words had CBA’ structure (where C and B stand for the third and second syllable of one family, and A’ stands for the first syllable of the other family), and two non-words had an AC’B structure, whereas in Stream B, the non-words had a BB’A or a BAA’ structure (Table ). In either stream, non-words had  frequency, as well as both adjacent and non-adjacent  TPs, whereas rule-words had  frequency and  adjacent TPs, but  non-adjacent TP. The material was synthesized with the Mbrola speech synthesizer (Dutoit, Pagel, Bataille & Vreken, ) using the FR- database, setting flat prosody,  ms phoneme length and  Hz pitch. Mean word length was  ms. The familiarization stream was synthesized with increasing and decreasing amplitude in the first and last  s, respectively, to avoid providing direct cues to word onsets. The stream lasted  min  s. In a modified version of the headturn preference procedure (Marchetto & Bonatti, ), infants sat on their caretaker’s laps in a dimly lit, quiet room 

FINDING WORDS AND WORD STRUCTURE

with three monitors positioned at their front and sides. The caretakers listened to masking music and were instructed not to interact with the infants during the experiment. During familiarization, a visual stimulus (a movie of a moving hand) attracted the infants towards the center while the speech stream played. In the test phase, two trials for each of the eight test stimuli were presented in a pseudo-random order, with the constraints that the same item could not be repeated in the immediately succeeding trial and that no more than three items of the same type could occur in a row. In each trial, the attractor appeared at the center first. After · s of continuous fixation, it disappeared and reappeared on one of the side monitors. As infants oriented towards it, the test item started playing repeatedly from loudspeakers hidden behind the monitors until infants looked away for  s consecutively or looked for more than  s cumulatively. Test items were repeated after a  ms ISI. An Apple G controlled by PsyScope X ran the experiment (http://psy.cns.sissa.it; Cohen, MacWhinney, Flatt & Provost, ). Infants’ looking behavior was recorded by a camera hidden behind the center monitor. The camera also allowed the experimenter to control the experimental procedure on-line; thus, the test trials began and ended contingent upon the infant’s looking behavior. Looking time was successively coded off-line. Looking times shorter than  s or  SD beyond the general mean computed for each test item type were excluded from further analysis. Across all experiments, excluded data ranged between ·% and ·%. Results and discussion We ran a mixed-design, repeated-measure ANOVA, with Stream (A or B) as a between-participant factor, test item type (Non-words vs. Rule-words) as a within-participant factor, and participants as a random factor nested in Stream. Infants looked longer while listening to non-words than to rule-words (Figure ; MNon-words = · s, SE = ·; MRule-words = · s, SE = ·, F(,) = ·, p 4 ·). There was no effect of stream (F(,) = ·, p = ·) and no interaction between stream and test item type (F(,) = ·, p = ·). Thus, infants responded to the nature of the test stimuli as rule-words or non-words, despite the changes in familiarization stream, test items, and non-word structure. These results suggest that phonotactic, phonetic, or phonological features of the test items or of the familiarization stream were not determining factors in the infants’ looking behavior. Both infants (Mattys, Jusczyk, Luce & Morgan, ) and adults (Peña et al., , footnote ; Onnis, Monaghan, Richmond & Chater, ) are sensitive to such information and can use it to segment and learn words (Gonzalez-Gomez & Nazzi, ; Gonzalez-Gomez, Poltrock & Nazzi, 

MARCHETTO AND BONATTI

12

*

*

Looking time (s)

10 8 6 Rule-Words Non-Words

4 2 0

Stream A

Stream B

Fig. . Results of Experiment . Mean looking time (s) and SE for test items in -month-old infants listening to rule-words/non-words after a segmented familiarization (Experiment ). Streams A and B are structurally identical, but the statistical relations among their syllables are such that the non-words in Stream A become the Rule-Words in Stream B.

). Nevertheless, these factors alone could not explain the results of Experiment . Further, simple monitoring of bigrams in the familiarization stream could not account for the results. Both in non-words and in rule-words, the bigram frequency (that is, the frequency of co-occurrence of two adjacent syllables) was . Because of the construction of the material in Streams A and B, Experiment  excludes that infants responded by only monitoring differences between test items and either the first or second syllable of familiarization words. Indeed, two of the non-words in Stream B had structure CBA’ and two had structure AC’B. Thus, in two of the four nonwords, the initial syllable was identical to that of words (and hence of rulewords). In the other two non-words, the initial syllable was not identical to that of words, but the middle syllable was. Infants did not react differently to the two types of non-words, suggesting that they did not monitor one single syllable in either initial or middle position. However, Experiment  does not exclude that infants responded by only monitoring differences in the last syllable of test items. To control for this possibility, we ran a control experiment reported in the Supplementary Materials. We maintained the familiarization and rule-words from Stream A of Experiment , but we created two novel non-words with BAC’ structure, with a last syllable identical to that of words. Even in this case, infants looked longer while listening to non-words than to rule-words, and did not react differently to the two types of non-words, suggesting that infants do not respond to simple 

FINDING WORDS AND WORD STRUCTURE

mismatches between test items and words in familiarization in one single position (see Supplementary Materials). In Experiment , neither rule-words nor non-words appeared during familiarization. Therefore, infants’ preference for non-words may indicate that -month-olds learned the structural regularity underlying word structure and generalized it to novel sequences containing it. The results of Experiment  indicate that -month-olds possess the resources to grasp morphosyntactic regularity, even when exposed to little variation and few exemplars of it. They suggest learning strategies consistent with the ‘less is more’ hypothesis (Newport, ). Specifically, for the onset of morphosyntax, they also document the presence of one crucial tool to analyze within-word relations (i.e. tracking multiple positional variations), which may be their entry point into the morphosyntax of their lexicon. Peña et al. () showed that adults can find words in a continuous speech stream on the basis of non-adjacent relations between syllables, but can generalize to their structure only if the stream is segmented. In Experiment , infants exposed to segmented streams succeeded in finding the morphological-like regularity underlying words. If infants respond to the same stimulus properties as adults, then when they are familiarized with a continuous stream, they may fail to find differences between rule-words and non-words, even when they can distinguish words from non-words. This pattern of results would strongly suggest that infants analyze a speech stream by responding to the same structural properties as adults, thus possibly using the same mechanisms. In the next two experiments, we exposed infants to streams with the same statistical properties of Experiment , but without pauses between words. We then tested their ability to discern either the contrast rule-words/non-words (Experiment ) or the contrast words/non-words (Experiment ).

EXPERIMENT  Participants, stimuli, and procedure Sixteen infants, aged ; (M = ;·; Range: ;·–;·), were retained for analysis ( girls). Seven additional infants were excluded ( fussed;  exceeded maximum looking criteria;  for equipment failure). Because Streams A and B did not yield different results in Experiment , we based this experiment on Stream A and its test items. We synthesized a new stream containing the same syllable sequences of Stream A, but we eliminated the pauses marking segmentations between the words. The stream was  min  s long. Infants were tested with the same rule-words and non-words as Experiment , Stream A (see Table ). All other aspects of the procedure were identical to Experiment . 

MARCHETTO AND BONATTI

12

*

*

Looking time (s)

10 8 6 Rule-Words Non-Words Words

4 2 0

Segmented (Exp.1 Stream A)

Continuous (Exp.2)

Continuous (Exp.3)

Familiarization Type

Fig. . Results of Experiments  − . Mean looking time (s) and SE for test items in -month-old infants listening to rule-words/non-words (Experiment , continuous familiarization) or to words/non-words (Experiment , continuous familiarization). For comparison, the graph reports the results of Experiment , Stream A (segmented familiarization), from which the familiarization of Experiment  was constructed by removing the segmentation indexes.

Results and discussion Infants looked equally to both test items (MNon-words = · s, SE = ·; MRule-words = · s, SE = ·, F(,) = ·, p = ·; Figure ). Thus, unlike Experiment , they failed to extract any structural information after familiarization to a continuous stream. To further validate the different reactions between exposure to a segmented or a continuous stream, we ran common analyses of the infants who were exposed to Stream A in Experiments  and  with and without segmentation marks, respectively, and were exposed to the very same test items. A mixed-design, repeated-measure ANOVA, with the Experiment ( or ) as a betweenparticipant factor, the test item type (Non-words vs. Rule-words) as a within-participant factor, and the participants as a random factor nested in Stream showed no main effect of Experiment or test item type, but a strong tendency towards an interaction of the two factors (F(,) = ·, p 4 ·). Scheffé post-hoc analyses showed that, as reported, infants exposed to a segmented stream looked longer at non-words than at rule-words (p 4 ·), but infants exposed to a continuous stream did not (p = ·). Experiments  and  suggest that infants need segmentation cues to capture within-word generalizations. Only when they were exposed to a segmented stream could they develop a sense of the structure underlying 

FINDING WORDS AND WORD STRUCTURE

the words in the AXC language, similar to adults (Peña et al., ; Bonatti et al., ; Endress & Bonatti, ). However, it is also possible that in Experiment  infants failed to manifest a differential behavior when listening to the rule-words and non-words during the test phase, not because they could not find the word-internal structure, but because they could not compute anything from the continuous stream to which they were exposed during familiarization. If so, then after the same familiarization, they should also fail to differentiate words from non-words. Experiment  tested this alternative explanation. EXPERIMENT  Participants, stimuli, and procedure Sixteen infants, aged ; (M = ;·; Range: ;·–;·), were retained for analysis ( girls). Eight additional infants were excluded because they fussed. Infants were familiarized with the same continuous stream from Experiment , but they listened to words and non-words in the test phase. The experiment was otherwise identical to Experiment . Results and discussion Infants looked longer when listening to non-words than to words (MNon-words = · s, SE = ·; MWords = · s, SE = ·, F(,) = ·, p 4 ·; Figure ), indicating that -month-olds are sensitive to statistical relations among syllables after familiarization with a continuous stream. This result can be interpreted both as evidence that, after being familiarized to a continuous stream, infants could exploit the within-word non-adjacent TPs of , or that they could exploit TP differences as low as ·. According to this second possibility, infants may succeed because they note that words have · adjacent TPs, but non-words have  adjacent TPs. While this account is possible, it is unlikely for several reasons. First, there is currently no clear evidence showing that infants can segment elements out of a continuum by exploiting low TPs. Aslin et al. () did show that infants at  months can differentiate trisyllabic sequences with TPs of  from sequences with TPs of ·. However, the demands of the current task are different. In Experiment , it is the highest TP level holding groups of syllables together that is at ·. Thus, items favored by a possible computation of adjacent TP relations have a very high level of noise. In contrast, in Aslin et al. (), words were clearly identified by noiseless TPs of  among adjacent syllables. Perhaps infants of this age cannot tolerate a high level of noise when computing adjacent TPs. Second, because both word-internal and cross-word TPs in the stream are ·, computing only adjacent TPs would not allow infants 

MARCHETTO AND BONATTI

to single out any particular syllable triplet. Hence, the test words should appear as arbitrary as non-words, unless we attribute to infants the ability to retain a very long string from which they can recall uniform snippets and compare them to other syllable combinations. This ability seems even harder than computing non-adjacent relations among syllables. Thus, we suggest that success at Experiment  may be due to the computation of the perfect TPs of  among non-adjacent syllables. Taken together, Experiments – show that -month-olds find words and word-internal structure in the same conditions as adults: when exposed to a segmented stream, they appear to capture word-internal structure (Experiment ), but when the stream is continuous, they are unable to extract this information (Experiment ), even though they may find its words (Experiment ). As recalled, Mintz () showed that -month-old infants can identify affixes of their natural languages, suggesting that by that age infants can begin morphosyntactic analysis of a natural language lexicon. We found that at approximately one year of age, when infants are exposed to an artificial speech stream, they can already deploy the computational resources to extract the morphosyntactic properties of its words, provided that the words have already been segmented out of a continuum. In the next experiments, we explored the onset of morphosyntactic sensitivity. We exposed -month-olds to the same segmented artificial streams with which -month-olds were familiarized, and we tested their abilities to differentiate rule-words or words from non-words. This age comparison is interesting for several reasons. At  months, infants can detect adjacent TPs between differing syllables in a continuous stream (Saffran, Newport, & Aslin, ); they can detect some rule patterns when familiarized to a set of discrete items (Marcus et al., ); and they begin to spot words in natural speech (Jusczyk & Aslin, ; Bergelson & Swingley, ). However, the process of perceptual convergence towards the phonological repertoire of their native language is not complete. Therefore, the lexical representations they can construct are necessarily partial. It is an interesting question whether such representations are solid enough to incorporate some structural properties of the lexical items. Seven-month-olds have all the necessary resources, when considered separately, to do such an analysis, but they may not yet be able to use their computational abilities in a conjoined way, or they may not feel the pressure coming from the emergence of a natural language system. If all that is needed to discover within-word relations are general statistical abilities, then they should succeed in our tasks, possibly under the same exposure conditions as -month-olds. However, if sensitivity to within-word structure develops only after infants are well advanced into the process of convergence towards their native language, then -month-olds may not attend to the internal 

FINDING WORDS AND WORD STRUCTURE

12

* Looking time (s)

10 8 6

Rule-Words Non-Words Words

4 2 0

Segmented (Exp.4)

Segmented (Exp.5)

Continuous (Exp.6)

Familiarization Type Fig. . Results of Experiments  − . Mean looking time (s) and SE for test items in -month-old infants listening to rule-words/non-words (Experiment ) and to words/ non-words after a segmented (Experiment ) or continuous (Experiment ) familiarization.

construction of word-like items, despite their proven abilities at capturing statistical relations in sounds and other stimuli. We first asked if -montholds can find word structure after exposure to a segmented stream, similar to the -month-olds in Experiment . EXPERIMENT  Participants Thirty-two infants, aged ; (M = ;·; Range: ;·–;·), were included in the analysis ( girls). An additional  infants participated, but were excluded ( fussed,  exceeded maximum looking criteria). Stimuli, procedure, and results The experiment and the data analyses were identical to Experiment . Infants showed no tendency to look differentially at rule-words or non-words (Figure ; MNon-Words = ·, SE = ·, MRule-Words = ·, SE = ·, F(,) = ·, p = ·). Furthermore, there was no effect of stream type (F(,) = ·, p = ·), and there was no interaction between the two factors (F(,) = ·, p = ·). To test whether -month-olds and -month-olds behaved differently, we ran common analyses of the -month-olds in Experiment  together with the -month-olds in the current experiment. A mixed-design, 

MARCHETTO AND BONATTI

repeated-measure ANOVA, with Age ( or  months) as a between-participant factor, the test item type (Non-words vs. Rule-words) as a within-participant factor, and the participants as a random factor nested in Age showed no main effect of Age (F(,) = ·, p = ·). There was a main effect of test item type (F(,) = ·, p 4 ·) but also an interaction between Age and Test Item Type (F(,) = ·, p 4 ·). Exploring the interaction, Scheffé post-hoc analyses showed that -month-olds looked significantly longer at non-words than at rule-words (p 4 ·), but that -month-olds did not (p = ·). That is, the main effect of Test Item Type was entirely due to the differential behavior when listening to rule-words or non-words in the -month-olds, whereas the -month-olds did not contribute to it. The results suggest that, at  months, the computational resources to find within-word structural relations are not yet available or are not deployed for this purpose. However, as may happen with a novel design yielding null results, it is also possible that -month-olds did not respond to it. Perhaps they can extract information about the structure of artificial words present in a segmented stream, but our design was not sensitive enough. We thus tested whether -month-olds could find words, rather than word structure, when tested with our design, familiarizing them to the streams of Experiment . If infants succeeded, then their failure in Experiment  may be specifically related to their inability to detect word structure. EXPERIMENT  Participants, stimuli, and procedure Thirty-two infants, aged ; (M = ;·; Range: ;·–;·), were retained for analysis ( girls). Seventeen additional infants were excluded ( fussed,  exceeded maximum looking criteria). The experiment was identical to Experiment , with the crucial difference that test items were words and non-words. Results and discussion The data analyses were identical to Experiments  and . Infants looked longer at non-words than at words (MNon-Words = · s, SE = ·; MWords = · s, SE = ·, F(,) = ·, p 4 ·). There was a main effect of Stream (MStreamA = · s, SE = ·; MStreamB = · s, SE = ·, F(,) = ·, p 4 ·). Importantly, however, the Stream factor did not interact with the Test Item Type (F(,) = ·, p = ·). Post-hoc tests with the Scheffé method showed that infants in both familiarization groups looked longer at non-words (MWords-StreamA = · s, SE = ·, MNon-Words-StreamA = · s, SE = ·, p 4 ·); MWords-StreamB = · s, SE 

FINDING WORDS AND WORD STRUCTURE

= ·, MNon-Words-StreamB = · s, SE = ·, p 4 ·). We attribute the main effect of Stream, which we found in no other experiment, to group differences. The results show that -month-olds did process the information contained in the segmented familiarization streams used in Experiments  and . They can find words in these streams. Hence, the failure to find rule-words in Experiment  can be interpreted as a specific lack of sensitivity to within-word relations. To better understand the -month-olds’ computations, we explored what cues they may use to identify the words in the stream of Experiment . They could be exploiting distributional information, such as adjacent TPs between syllable bigrams or non-adjacent syllable TPs, or they could use the segmentation marks separating words. Experiment  probed the causes of the infants’ success in Experiment . If they computed distributional information, then infants should also be able to extract words from a continuous stream because the absence of pauses does not modify the statistical relations among syllables. Instead, if they found words by exploiting the segmentation marks between them, then, when these marks are removed, -month-olds should fail to find the same words they successfully found in Experiment . EXPERIMENT  Participants, stimuli, and procedure Thirty-two infants, aged ; (M = ;·; Range: ;·–;·), were included in the analysis ( girls). Nineteen additional infants were excluded ( fussed,  exceeded maximum looking criteria,  for equipment failure). The procedure and data analyses were identical to Experiment , with the difference that the familiarization streams had no pauses separating words. Results Infants showed no tendency to look differentially at words or non-words (MNon-Words = ·, SE = ·, MWords = ·, SE = ·, F(,) = ·, p = ·). There was no effect of stream (MStreamA = · s, SE = ·; MStreamB = · s, SE = ·, F(,) = ·, p = ·), and no interaction between the two factors (MStreamA = · s, SE = ·; MStreamB = · s, SE = ·, F(,) = ·, p = ·). To compare the effect of a continuous or a segmented familiarization on the ability to tell words and non-words apart at  months, we ran common analyses of the participants in Experiments  and . A mixed-design, repeated-measure ANOVA, with the Familiarization Stream (with/without pauses) as a between-participant factor, the test item type (Non-words vs. Rule-words) as a within-participant factor, and the participants as a random factor nested in the Familiarization 

MARCHETTO AND BONATTI

Stream showed no main effect of the Familiarization Stream (F(,) = ·, p = ·). Infants in both experiments looked equally longer at test items regardless of whether they were exposed to a continuous or a segmented familiarization. The test item type did have an effect (F(,) = ·, p 4 ·). However, the interaction between Familiarization Stream and Test Item Type was also significant (F(,) = ·, p 4 ·). Scheffé post-hoc analyses showed that only those infants exposed to a segmented stream looked longer at non-words than at words (p 4 ·). By contrast, those infants exposed to a continuous stream did not (p = ·). Thus, apparently, being exposed to a segmented stream was necessary for infants to tell words apart from non-words.

Discussion Surprisingly, infants failed to segment words from the continuous stream. They could not segment them out, even if the words were statistically favored with respect to non-words, having perfect non-adjacent TPs of  (against ), adjacent TPs of · (against ), and a higher bigram frequency. This failure is particularly informative. Seven-month-olds can track adjacent syllable co-occurrence in a continuous speech stream when their TPs are perfect (Saffron, Newport, & Aslin, ; Aslin et al., ). They are also sensitive to bigram frequency (Aslin et al., ). The current result suggests that, in our case, infants could not compute any of these measures, suggesting that -month-olds’ abilities at tracking statistics in the signal are rather weak and are likely limited to very high adjacent relations. Indeed infants could not even exploit multiple statistical redundancies in the signal. The negative result of Experiment  allows us to better interpret what -month-olds did in Experiment , where they succeeded at differentiating words from non-words. It suggests that the infants succeeded not because they computed any sort of statistical relation among syllables but most likely because the segmentation marks in the familiarization streams provided a grouping advantage for words over non-words. Once these marks were removed, the infants were unable to detect a difference between test items. Together, Experiments  and  suggest that infants’ first headway into a lexicon may be particularly influenced by words heard in isolation (e,g., Brent & Siskind, ; Lew-Williams, Pelucchi & Saffran, ), and utterance boundaries or other prosodic segmentation cues (e.g. Shukla et al., ), rather than by the computation of statistical relations from natural language streams. Indeed, these relations, in a natural context, are often well below the perfect or quasi-perfect TPs between adjacent items (Yang, ) that appear to be the limit of computational abilities in infants. Experiments , , and  also allow us to clarify the results obtained with the -month-olds. Experiments – showed that -month-olds reacted 

FINDING WORDS AND WORD STRUCTURE

differently to non-words and rule-words after exposure to a segmented stream. We suggested that this behavior could be due to infants’ sensitivity to word-internal structure. However, two other interpretations were possible. First, -month-olds might have responded to the relation between syllables and pauses, rather than to word-internal structure. In a segmented stream, words are preceded and followed by pauses. Thus, pauses could have created an ‘edge effect’ during familiarization, making both edge positions particularly salient (Endress, Scholl & Mehler, ; Endress & Mehler, ). Indeed, edges of words and syntactic constituents are particularly important in natural language. For example, word stress is always defined relative to word edges (e.g. word-initial, or word-final). Likewise, affixation most often occurs at word edges. Or, syntactic and phonological phrases are always aligned at least in one edge (Nespor & Vogel, ; see Endress, Nespor & Mehler, , for an overview). Therefore, sensitivity to edges may be a plausible determinant of the -month-olds’ results. This interpretation raises the possibility that infants could have responded on the basis of primitive perceptual mechanisms not related to morphological structure. Together, Experiments  and  render this explanation implausible. Such primitive mechanisms are present even before  months (Gervain, Macagno, Cogoi, Peña & Mehler, b). If infants only monitored the syllables preceded or followed by pauses, then, after being familiarized to a segmented stream, -month-olds should have succeeded both when comparing rule-words against non-words and when comparing words against non-words because both contrasts are identical in this respect. Yet they succeed in the latter, but not in the former, contrast. Thus, edge effects are not sufficient to explain the results. By a similar argument, a second alternative interpretation of the -month-olds’ behaviors now appears implausible, if not impossible. The interpretation held that -month-olds responded differently to rule-words and non-words in Experiment  not because they captured the words’ internal structure but because they monitored positional differences between familiarization words and the test items. However, although to our knowledge the issue has not been directly studied, several studies indicate that infants can monitor positional variations even at  months. For example, Marcus et al. () showed that -month-olds detect a difference between AAB and ABA or ABB and ABA patterns, and this implies that infants can detect positional variations at an even more abstract level. Furthermore, in a paradigm more closely resembling the current experiments, Gervain et al. (a) showed that, after exposure to a continuous stream of artificial syllables containing frequent and infrequent items, -month-olds (but not -month-olds; see Yoshida et al., ) are already sensitive to the frequency and position of the syllables because they can match the frequent–infrequent pattern of a polysyllabic element 

MARCHETTO AND BONATTI

with the frequent/infrequent order of syntactic elements in their native language. To succeed in discriminating between test items, infants in our studies must be sensitive to positional changes of even one single position in a string. Yet -month-olds failed to differentiate rule-words and non-words even after a segmented familiarization. This failure shows that success in our task does not come just from tracking syllable positional information in words. The conclusion holds for -month-olds and, a fortiori, for -month-olds. In conclusion, the pattern of results we found with -month-olds is unlikely to be explained by some shallow properties of the test items, such as a simple violation of positional information, and strongly suggests the same conclusion for the results with -month-olds. The infants’ reactions appear to be driven by a deeper level of analysis of the stimuli heard in familiarization – a level that -month-olds, but not -month-olds, are able to master.

GENERAL DISCUSSION

To acquire a language, a learner must build a lexicon and find the productive rules generating an infinite set of novel sentences. However, the problems of finding the words of a language and that of finding the words’ structure are not independent. Most rules are hidden within words, as the widespread presence of morphosyntactic rules in natural languages attests. How do infants solve the simultaneous problems of finding words and structure within words? In this paper, we began probing the origins of morphosyntactic abilities by studying how infants acquire elementary artificial languages. While they are a far cry from real natural languages, artificial languages share commonalities with real natural language stimuli. They activate broadly overlapping brain circuits, they mimic the processes involved in the acquisition of structural elements of a natural language, and they produce representations that carry some of the crucial features that the representations attached to words have in a natural language, such as being content-carrying sounds. Thus, it seemed reasonable to us to assume that, to some extent, the sensitivity to the internal structure of artificially induced ‘words’ may reveal abilities of morphological analysis that can be precursors or direct components of a nascent sensitivity to the morphosyntax of natural language words. In a series of artificial language experiments, we tried to reproduce word learning and morphosyntactic learning in the laboratory. We induced infants to segment an artificial stream into trisyllabic elements that could be possible word candidates. We then asked when and under what conditions infants begin finding a structure within such trisyllabic elements. 

FINDING WORDS AND WORD STRUCTURE

There are many possible strategies that preverbal infants could deploy. At the extremes, one conservative strategy is to wait and see: keep collecting data (in this case, words) until you can extract the correct rules of composition by cross-tabulating variations among words. At the other extreme, a more aggressive strategy is to boldly project structure (Newport, ; Bonatti, ): as soon as you have a few words in your lexicon, try to find out how they are built. The former strategy would predict a late, or very late, onset of morphosyntactic abilities. The latter strategy would predict that infants try to look at the structure within words as soon as they begin converging towards their native language – perhaps not earlier, but not much later. Given their limited cognitive resources, the aggressive strategy could pay off better for infants. Instead of recording every word occurrence with its context, an efficient, but limited, system would benefit from projecting structure before the vocabulary size becomes unmanageable. Our main findings can be summarized as follows. (i) At  months, infants can capture the word-internal structure of an artificial language after exposure to a segmented speech stream (Experiment ), without the need of extensive exposure or the need to perceive extensive variability in the token words. In contrast, -month-olds fail the same task (Experiment ), despite the fact that they can extract the words after exposure to the same stream (Experiment ). (ii) At  months, infants can extract words from a continuous speech stream, by computing non-adjacent probabilities or less than perfect adjacent TPs (Experiment ). In contrast, -month-old infants fail to exploit the same statistical cues (Experiment ) and seem to rely predominately on clear segmentation cues to identify units in an artificial speech stream (Experiment ). (iii) At  months, infants fail to grasp the internal structure of words after exposure to a continuous speech stream (Experiment ), although they can identify these words after exposure to the same stream (Experiment ) and could, in principle, use the same distributional properties that allow them to find words in order to extract word structure. We discuss these results in turn.

Finding structure in the words of an artificial language at  months We showed that -month-old infants briefly exposed to segmented, but not to continuous, streams could identify word-internal structural properties. The perception of word-internal relations does not need extensive familiarization or extensive variability of the middle element, suggesting that the tendency to find structure over-regularizing from an 

MARCHETTO AND BONATTI

impoverished input – a phenomenon documented in older children (Kam & Newport, , ) – may already be present at the end of the first year. Although further research is needed to establish the exact computation that leads infants to succeed in our tasks, our results show that, at  months, infants can delve into the internal composition of words in an artificial language, looking for structural information starting from a scant and novel potential lexicon. In contrast, -month-olds showed no sensitivity to word structure. At that age, infants possess the ability to grasp structural information inside clearly separated sequences of sounds (Marcus et al., ). These abilities do not seem to translate into the ability to analyze within-word relations. This failure to find word structure, despite a successful identification of the words contained in the stream, suggests that specific operations of proto-morphosyntactic analysis may be absent. We proposed to locate the onset of morphosyntax at a developmental phase in which proper representations of a natural language lexicon can be constructed. At  months, infants could not even construct complete representations of natural language words, if nothing else because the process of phonological narrowing has not been completed. It may even be detrimental to the development of a functional morphosyntax to project within-word productive rules at this stage. When infants can represent sequences of sounds as potential candidates for word-ness in their language, then they may begin building a proper lexicon, and they may also become interested in delving into the internal composition of their nascent lexicon. Our data are compatible with this hypothesis. In the following sections, we explore other aspects of this potential interpretation. The role and time course of TP computations in finding words and word structure Many studies have documented remarkable statistical abilities in very young infants, but the exact power of such abilities is still relatively unknown. In our studies, -month-olds could find words inside an artificial continuous stream. They may have performed so by computing high non-adjacent TPs or by distinguishing less-than-perfect adjacent TPs from sequences without statistical coherence. Whatever the exact computation, we documented that, at this age, infants possess more powerful statistical abilities than previously known. Unlike -month-olds, -month-olds failed to find words after exposure to the same continuous stream. This failure is surprising. At this age, infants can compute adjacent TPs under many different conditions. To distinguish words from non-words in our experiments, it is sufficient to detect the difference between adjacent syllable transitions with · probability and the -probability transitions characterizing non-words. Our findings suggest that -month-olds can segment syllable groups out of a continuum only when one of the contrasting classes is identified by an extremely clear signal, 

FINDING WORDS AND WORD STRUCTURE

such as that communicated by perfect TPs. When the signal is degraded, the power of statistical computations that infants of that age possess is not sufficient to solve the problem of word segmentation. When they identify words in natural language, as they do (e.g. Jusczyk & Aslin, ; Shi et al., ; Bergelson & Swingley, ), they may exploit other properties of the signal, such as prosody (Shukla, White & Aslin, ), that do not reduce to TP computations. The words in our experiments are also identifiable by perfect non-adjacent TPs between their first and last syllables. Thus, the failure in -month-olds may also be interpreted as suggesting a lack of sensitivity to non-adjacent relations. When non-adjacency is realized as identical syllable repetition (e.g. pu-ne-pu), infants do succeed in identifying relations at a distance (Gervain and Werker, b). Perhaps the support of basic, most likely non-linguistic, mechanisms such as repetition detectors (Gervain et al., b; Endress & Mehler, ) may boost a still very primitive ability. Under either interpretation, our data show that statistical abilities in -month-olds are quite frail, and very far from dealing with TP values present in real natural language samples (Yang, ). Other evidence seems to lead to this conclusion. For example, Johnson and Tyler () showed that by simply using words of different lengths in an artificial language, even -month-olds failed to use TPs to segment words from a continuous stream, unless the stream contained other cues, such as familiar words in the infant’s natural language, which may help them to tune their computations to its relevant parts (Mersad & Nazzi, ). Arguably, words of different lengths are very common in natural speech, suggesting that TP computations may have a limited role in more natural settings than those explored until now. The exact strength or weakness of TPs in real word segmentation is a topic that requires further research.

Segmentation, generalization, and the properties of the speech signal Perhaps the most surprising finding in our experiments is that -month-olds could find words after exposure to a continuous stream, but showed no sensitivity to word-internal structure after listening to the same stream. At the same time, they could extract such information after being exposed to a segmented stream. Remarkably, the results of the -month-olds closely resemble those of Peña et al. (): adults could find words in a continuous speech stream, they showed no sign of sensitivity to non-adjacent word structure, but they could easily grasp the morphological structure when the words were separated by even minimal segmentation cues (Peña et al., ; Endress & Bonatti, ). Endress and Bonatti () proposed that this dissociation signals the presence of two mechanisms analyzing a speech stream. One mechanism, relying on 

MARCHETTO AND BONATTI

general-purpose statistical computations, breaks the continuum into its components. The other mechanism, more akin to the projection of general structures from exemplars than to statistical data gathering (Bonatti, ), is responsible for structure extraction. This second mechanism can act upon previously segmented material and can quickly extract generalizations among the segmented components, even when they are realized among non-adjacent elements. Recent neurophysiological evidence suggests that different brain circuits support the initial process of identifying words and the detection of word structure after exposure to streams very similar to those used by Peña et al. () (De Diego-Balaguer et al., ; De Diego-Balaguer, Fuentemilla & Rodriguez-Fornells, ). These results suggest that the construction of a lexical database in the process of natural language acquisition and the identification of its morphosyntactic rules may be due to different processes, potentially dissociable in their brain representation. The similarity between infants and adults exposed to comparable conditions may suggest that similar mechanisms are active in the infant brain, characterized by different developmental timecourses. However, by design, our experiments cannot directly inform us about the mechanisms underlying infants’ abilities. Alternative explanations of -month-olds’ success at differentiating rule-words and non-words after exposure to a segmented stream (appealing, for example, to merely the position of syllables within words, or of word edges) are possible. However, our data show that it is not easy to elaborate them. While our results leave many issues wide open, they suggest that learning words and discovering word-internal structures are distinct tasks and present different levels of complexity. These abilities seem to appear at different moments in early development. Infants can compute both words and word structure from a speech stream (Saffran & Wilson, ), but in different ways and at different ages. Our experiments document how quickly the focus of language learning switches from the identification of chunks of sound patterns to the organization of lexical knowledge at a deeper level of abstraction. A few months after infants can segment a continuum, they can already see words of an artificial language as more than simple unanalyzed syllable groups. This suggests that, at the end of their first year, when the needs to break into one’s own native language, to learn it and eventually to produce it, become pressing, infants may already code words as entities structured by internal rules of generation that they quickly set out to discover, thus building the morphological structure together with their lexicon.



FINDING WORDS AND WORD STRUCTURE SUPPLEMENTARY MATERIALS

The supplementary material referred to in this paper can be found online at .

REFERENCES Aronoff, M. & Fudeman, K. A. (). What is morphology, nd ed. Chichester & Malden, MA: Wiley-Blackwell. Aslin, R. N., Saffran, J. R. & Newport, E. L. (). Computation of conditional probability statistics by -month-old infants. Psychological Science (), –. Bergelson, E. & Swingley, D. (). At – months, human infants know the meanings of many common nouns. PNAS Proceedings of the National Academy of Sciences of the United States of America (), –. Bisetto, A. & Melloni, C. (). Parsynthetic compounding. Lingue e Linguaggio VII(), –. Bonatti, L. L. (). On pigeons, humans, language and the mind. In P. Carruthers, S. Laurence & S. Stich (eds), The innate mind: foundations and future, –. Oxford: Oxford University Press. Bonatti, L. L., Peña, M., Nespor, M. & Mehler, J. (). Linguistic constraints on statistical computations: the role of consonants and vowels in continuous speech processing. Psychological Science (), –. Brent, M. R. & Siskind, J. M. (). The role of exposure to isolated words in early vocabulary development. Cognition (), B−. Caprin, C. & Guasti, M. T. (). The acquisition of morphosyntax in Italian: a cross-sectional study. Applied Psycholinguistics (), –. Cohen, J. D., MacWhinney, B., Flatt, M. & Provost, J. (). PsyScope: an interactive graphic system for designing and controlling experiments in the psychology laboratory using Macintosh computers. Behavior Research Methods, Instruments & Computers , –. De Diego-Balaguer, R., Fuentemilla, L. & Rodriguez-Fornells, A. (). Brain dynamics sustaining rapid rule extraction from speech. Journal of Cognitive Neuroscience (), –. de Diego-Balaguer, R. & Lopez-Barroso, D. (). Cognitive and neural mechanisms sustaining rule learning from speech. Language Learning (Supplement ), –. De Diego-Balaguer, R., Toro, J. M., Rodriguez-Fornells, A. & Bachoud-Levi, A. C. (). Different neurophysiological mechanisms underlying word and rule extraction from speech. PLoS ONE (), e. Dutoit, T., Pagel, V., Bataille, F. & Vreken, O. (). The MBROLA project: towards a set of high-quality speech synthesizers free of use for non-commercial purposes. In Proceedings of the Fourth International Conference on Spoken Language Processing, Philadelphia. Endress, A. & Bonatti, L. L. (). Rapid learning of syllable classes from a perceptually continuous speech stream. Cognition (), –. Endress, A. D. & Mehler, J. (). Primitive computations in speech processing. Quarterly Journal of Experimental Psychology (), –. Endress, A. D., Nespor, M. & Mehler, J. (). Perceptual and memory constraints on language acquisition. Trends in Cognitive Sciences (), –. Endress, A. D., Scholl, B. J. & Mehler, J. (). The role of salience in the extraction of algebraic rules. Journal of Experimental Psychology: General (), –. Gertner, Y. & Fisher, C. (). Predicted errors in children’s early sentence comprehension. Cognition (), –. Gervain, J., Macagno, F., Cogoi, S., Peña, M. & Mehler, J. (b). The neonate brain detects speech structure. PNAS Proceedings of the National Academy of Sciences of the United States of America (), –.



MARCHETTO AND BONATTI

Gervain, J., Nespor, M., Mazuka, R., Horie, R. & Mehler, J. (a). Bootstrapping word order in prelexical infants: a Japanese−Italian cross-linguistic study. Cognitive Psychology (), –. Gervain, J., Sebastián-Gallés, N., Díaz, B., Laka, I., Mazuka, R., Yamane, N. N., Nespor, M., & Mehler, J. (). Word frequency cues word order in adults: cross-linguistic evidence. Frontiers in Psychology , –. Gervain, J. & Werker, J. F. (a). Prosody cues word order in -month-old bilingual infants. Nature Communications (), –. Gervain, J. & Werker, J. F. (b). Learning non-adjacent regularities at age ;. Journal of Child Language (), –. Gómez, R. L. (). Variability and detection of invariant structure. Psychological Science (), –. Gómez, R. L. & Gerken, L. (). Artificial grammar learning by -year-olds leads to specific and abstract knowledge. Cognition (), –. Gómez, R. L. & Lakusta, L. (). A first step in form-based category abstraction by -month-old infants. Developmental Science (), –. Gómez, R. L. & Maye, J. (). The developmental trajectory of nonadjacent dependency learning. Infancy (), –. Gonzalez-Gomez, N. & Nazzi, T. (). Phonotactic acquisition in healthy preterm infants. Developmental Science (), –. Gonzalez-Gomez, N., Poltrock, S. & Nazzi, T. (). A ‘bat’ is easier to learn than a ‘tab’: effects of relative phonotactic frequency on infant word learning. PLoS ONE (), e. Guasti, M. T. (). Verb syntax in Italian child grammar: finite and nonfinite verbs. Language Acquisition: A Journal of Developmental Linguistics (), –. Guasti, M. T. (). Language acquisition: the growth of grammar. Cambridge, MA: MIT Press. Halberda, J. (). The development of a word-learning strategy. Cognition (), B−. Hirsh-Pasek, K. & Golinkoff, R. M. (). The origins of grammar: evidence from early language comprehension. Cambridge, MA: MIT Press. Höhle, B., Weissenborn, J., Kiefer, D., Schulz, A. & Schmitz, M. (). Functional elements in infants’ speech processing: the role of determiners in the syntactic categorization of lexical elements. Infancy (), –. Johnson, E. K., Jusczyk, P. W., Cutler, A. & Norris, D. (). Lexical viability constraints on speech segmentation by infants. Cognitive Psychology (), –. Johnson, E. K. & Tyler, M. D. (). Testing the limits of statistical learning for word segmentation. Developmental Science (), –. Jusczyk, P. W. & Aslin, R. N. (). Infants’ detection of the sound patterns of words in fluent speech. Cognitive Psychology (), –. Kam, C. L. H. & Newport, E. L. (). Regularizing unpredictable variation: the roles of adult and child learners in language formation and change. Language Learning and Development (), –. Kam, C. L. H. & Newport, E. L. (). Getting it right by getting it wrong: when learners change languages. Cognitive Psychology (), –. Langus, A., Marchetto, E., Bion, R. A. H. & Nespor, M. (). Can prosody be used to discover hierarchical structure in continuous speech? Journal of Memory and Language (), –. Lany, J. & Saffran, J. R. (). From statistics to meaning: infants’ acquisition of lexical categories. Psychological Science (), –. Lavale Ortiz, R. M. (). Sensory-emotional denominal causative verbs. In J. L. C. Honrubia & S. R. Rosique (eds), Spanish word formation and lexical creation (IVITRA Research in Linguistics and Literature), –. Amsterdam & Philadelphia: John Benjamins Publishing Company. Lew-Williams, C., Pelucchi, B. & Saffran, J. R. (). Isolated words enhance statistical language learning in infancy. Developmental Science (), –.



FINDING WORDS AND WORD STRUCTURE

López-Barroso, D., Catani, M., Ripollés, P., Dell’Acqua, F., Rodríguez-Fornells, A. & de Diego-Balaguer, R. (). Word learning is mediated by the left arcuate fasciculus. PNAS Proceedings of the National Academy of Sciences of the United States of America (), –. Lopez-Barroso, D., de Diego-Balaguer, R., Cunillera, T., Camara, E., Münte, T. F. & Rodriguez-Fornells, A. (). Language learning under working memory constraints correlates with microstructural differences in the ventral language pathway. Cerebral Cortex (), –. Marchetto, E. & Bonatti, L. L. (). Words and possible words in early language acquisition. Cognitive Psychology (), –. Marcus, G. F., Vijayan, S., Rao, S. B. & Vishton, P. M. (). Rule learning by seven-month-old infants. Science (), –. Mattys, S. L., Jusczyk, P. W., Luce, P. A. & Morgan, J. L. (). Phonotactic and prosodic effects on word segmentation in infants. Cognitive Psychology (), –. McNealy, K., Mazziotta, J. C. & Dapretto, M. (). Cracking the language code: neural mechanisms underlying speech parsing. Journal of Neuroscience (), –. McNealy, K., Mazziotta, J. C. & Dapretto, M. (). The neural basis of speech parsing in children and adults. Developmental Science (), –. Mersad, K. & Nazzi, T. (). When Mommy comes to the rescue of statistics: infants combine top-down and bottom-up cues to segment speech. Language Learning and Development (), –. Mintz, T. H. (). The segmentation of sub-lexical morphemes in English-learning -month-olds. Frontiers in Psychology , –. Nespor, M. & Vogel, I. (). Prosodic phonology. Dordrecht: Foris. Newport, E. L. (). Maturational constraints on language learning. Cognitive Science (), –. Onnis, L., Monaghan, P., Richmond, K. & Chater, N. (). Phonology impacts segmentation in online speech processing. Journal of Memory and Language (), –. Payne, T. E. (). Describing morphosyntax: a guide for field linguists. Cambridge & New York, NY: Cambridge University Press. Pelucchi, B., Hay, J. F. & Saffran, J. R. (). Statistical learning in a natural language by -month-old infants. Child Development (), –. Peña, M., Bonatti, L. L., Nespor, M. & Mehler, J. (). Signal-driven computations in speech processing. Science (), –. Pizzuto, E. & Caselli, M. C. (). The acquisition of Italian morphology: implications for models of language development. Journal of Child Language (), –. Saffran, J. R., Aslin, R. N. & Newport, E. L. (). Statistical learning by -month-old infants. Science (), –. Saffran, J. R., Newport, E. L. & Aslin, R. N. (). Word segmentation: the role of distributional cues. Journal of Memory and Language (), –. Saffran, J. R. & Wilson, D. P. (). From syllables to syntax: multilevel statistical learning by -month-old infants. Infancy (), –. Santelmann, L. M. & Jusczyk, P. W. (). Sensitivity to discontinuous dependencies in language learners: evidence for limitations in processing space. Cognition (), –. Shi, R. & Lepage, M. (). The effect of functional morphemes on word segmentation in preverbal infants. Developmental Science (), –. Shi, R., Werker, J. F. & Cutler, A. (). Recognition and representation of function words in English-learning infants. Infancy (), –. Shukla, M., Nespor, M. & Mehler, J. (). An interaction between prosody and statistics in the segmentation of fluent speech. Cognitive Psychology (), –. Shukla, M., White, K. S. & Aslin, R. N. (). Prosody guides the rapid mapping of auditory word forms onto visual objects in -month-old infants. PNAS Proceedings of the National Academy of Sciences of the United States of America (), –.



MARCHETTO AND BONATTI

Slaughter, V. & Suddendorf, T. (). Participant loss due to ‘fussiness’ in infant visual paradigms: a review of the last  years. Infant Behavior and Development (), –. Stager, C. L. & Werker, J. F. (). Infants listen for more phonetic detail in speech perception than in word-learning tasks. Nature , –. Thiessen, E. D. & Erickson, L. C. (). Discovering words in fluent speech: the contribution of two kinds of statistical information. Frontiers in Psychology , –. Toro, J. M., Nespor, M., Mehler, J. & Bonatti, L. L. (). Finding words and rules in a speech stream: functional differences between vowels and consonants. Psychological Science (), –. Torrens, V. (). The acquisition of inflection in Spanish and Catalan. In C. T. Schütze, J. B. Ganger & K. J. Broihier (eds), Papers on language processing and acquisition. Cambridge, MA: MiTWPL, Department of Linguistics, MIT. Vilares, J., Cabrero, D. & Alonso, M. (). Applying productive derivational morphology to term indexing of Spanish texts. In Computational Linguistics and Intelligent Text Processing , –. Berlin & Heidelberg: Springer. Werker, J. F. & Tees, R. C. (). Developmental changes across childhood in the perception of non-native speech sounds. Canadian Journal of Psychology (), –. Werker, J. F. & Tees, R. C. (). Cross-language speech perception: evidence for perceptual reorganization during the first year of life. Infant Behavior and Development (), –. Yang, C. D. (). Universal Grammar, statistics or both? Trends in Cognitive Sciences (), –. Yoshida, K. A., Iversen, J. R., Patel, A. D., Mazuka, R., Nito, H., Gervain, J. J., & Werker, J. F. (). The development of perceptual grouping biases in infancy: a Japanese−English cross-linguistic study. Cognition (), –.



Finding words and word structure in artificial speech: the development of infants' sensitivity to morphosyntactic regularities.

To achieve language proficiency, infants must find the building blocks of speech and master the rules governing their legal combinations. However, the...
457KB Sizes 0 Downloads 6 Views