How are things adding up? Neural differences between arithmetic operations are due to general problem solving strategies.

NeuroImage 92 (2014) 369–380

Contents lists available at ScienceDirect

NeuroImage journal homepage: www.elsevier.com/locate/ynimg

How are things adding up? Neural differences between arithmetic operations are due to general problem solving strategies Nadja Tschentscher ⁎, Olaf Hauk Medical Research Council, Cognition and Brain Sciences Unit, 15 Chaucer Road, Cambridge CB2 7EF, UK

a r t i c l e

i n f o

Article history: Accepted 26 January 2014 Available online 10 February 2014 Keywords: fMRI Arithmetic Strategy Problem solving Embodied cognition

a b s t r a c t A number of previous studies have interpreted differences in brain activation between arithmetic operation types (e.g. addition and multiplication) as evidence in favor of distinct cortical representations, processes or neural systems. It is still not clear how differences in general task complexity contribute to these neural differences. Here, we used a mental arithmetic paradigm to disentangle brain areas related to general problem solving from those involved in operation type specific processes (addition versus multiplication). We orthogonally varied operation type and complexity. Importantly, complexity was defined not only based on surface criteria (for example number size), but also on the basis of individual participants' strategy ratings, which were validated in a detailed behavioral analysis. We replicated previously reported operation type effects in our analyses based on surface criteria. However, these effects vanished when controlling for individual strategies. Instead, procedural strategies contrasted with memory retrieval reliably activated fronto-parietal and motor regions, while retrieval strategies activated parietal cortices. This challenges views that operation types rely on partially different neural systems, and suggests that previously reported differences between operation types may have emerged due to invalid measures of complexity. We conclude that mental arithmetic is a powerful paradigm to study brain networks of abstract problem solving, as long as individual participants' strategies are taken into account. © 2014 Elsevier Inc. All rights reserved.

Introduction Mental arithmetic is a highly over-learned skill which can nevertheless require considerable mental effort. Hence, it provides an excellent framework for the investigation of cognitive processes underlying abstract problem solving in a well-controlled setting. Several authors have already emphasized the role of executive functions, verbal processes, and sensory–motor derived concepts for arithmetic problem solving (e.g. Anderson et al., 2011; Arsalidou and Taylor, 2011). In previous research, neural differences in brain activation between arithmetic operation types (e.g. addition and multiplication) have been interpreted as evidence that these operations rely on distinct neural representations, e.g. within language or sensory–motor systems. With respect to the involvement of sensory–motor systems in mental arithmetic, it has been suggested that arithmetic problem solving, and numerical cognition in general, may be embodied, i.e. may rely on our sensory–motor experiences within the environment (Fischer, 2012; Lakoff and Núñez, 2000). This might be reflected in associations of numbers or specific arithmetic tasks with finger-counting patterns, or with movement along a mental number line (cf. Andres et al., 2012; Klein et al., 2011; Knops et al., 2009a,b; Tschentscher et al., 2012). Further, it has been proposed that evolutionary older brain

⁎ Corresponding author. E-mail address: [email protected] (N. Tschentscher). 1053-8119/$ – see front matter © 2014 Elsevier Inc. All rights reserved. http://dx.doi.org/10.1016/j.neuroimage.2014.01.061

circuits of magnitude processing are “recycled” for more recent culturally acquired cognitive functions, such as symbolic arithmetic (Dehaene and Cohen, 2007). The degree to which specific arithmetic operations require these evolutionary older brain systems may depend on their similarity with the cognitive processes these systems support. It has been proposed that the degree of similarity between arithmetic and the cognitive processes, which are supported by evolutionary older systems, might vary across types of basic arithmetic operations (Prado et al., 2011). This “cultural recycling” theory provides an evolutionary underpinning for embodied theories of cognition. Empirical evidence on content-specific neural systems Previous behavioral and neuroimaging evidence has been interpreted in favor of the embodiment hypothesis (for review, see Hauk and Tschentscher, 2013). Several authors have suggested that sensory– motor knowledge (Badets et al., 2010; Klein et al., 2011) and spatialattention processes (Knops et al., 2009a,b; Pinhas and Fischer, 2008) are involved in addition and subtraction tasks, while multiplication has been more strongly associated with left-lateralized language networks (Andres et al., 2010; Chochon et al., 1999; Grabner et al., 2009a; Lee and Kang, 2002; Zhou et al., 2006b). Evidence has been provided for shared neural resources of simple mental subtraction and finger discrimination (Andres et al., 2012), in line with the impact of individual finger-counting habits on the cortical representation of numbers (Tschentscher et al., 2012).

370

N. Tschentscher, O. Hauk / NeuroImage 92 (2014) 369–380

Furthermore, the specific involvement of visual–spatial processes in subtraction and addition has been shown by the “Operational Momentum” effect for single-digit numbers and non-symbolic numerals (Knops et al., 2009b; Pinhas & Fischer, 2008). This is supported by results of multi-voxel pattern analyses (MVPA), revealing different neural activation patterns for simple addition and subtraction in posterior superior parietal lobule (PSPL), an area that is involved in eye-movements and spatial attention (Knops et al., 2009a). Conversely, several studies have reported stronger left-lateralized activation for multiplication, in favor of specific language-based processing (Prado et al., 2011). Direct comparison of addition and multiplication tasks revealed more activation in left-hemispheric premotor and supplementary motor regions for multiplication, as well as in posterior and anterior superior temporal gyrus (cf. Chochon et al., 1999; Zhou et al., 2006b). An effect of TMS in left parietal regions was reported in multiplication tasks (Andres et al., 2010), and stronger activation for multiplication than addition tasks was found for language-associated left angular gyrus (AG) regions independently from task-complexity effects (Grabner et al., 2009a). However, the specific role of languagebased fact retrieval for multiplication has been challenged by recent fMRI evidence, reporting operation type effects in the right hemispheric posterior intraparietal regions as opposed to the left-hemispheric AG (Rosenberg-Lee et al., 2011). Finally, a recent meta-analysis on fMRI activations for arithmetic operations contrasted against a control task reported distinct prefrontal and parietal effects for types of basic arithmetic operations (Arsalidou & Taylor, 2011). Content-specific arithmetic effects — a matter of general task complexity? Several factors may have confounded previous interpretations of neural operation type effects as reported in previous neuroimaging studies. While the differential involvement of visual–spatial, sensory– motor, and verbal processes has been suggested for addition, subtraction and multiplication, the neuroscientific investigation of procedures and strategies underlying arithmetic problem solving has received relatively less attention so far. It is still an open question whether observed neural differences between basic arithmetic operation types reflect “true” differences in operation-specific representations or processes (e.g. whether retrieving the solution of a simple addition problem such as “2 + 3” really requires qualitatively different processes than retrieving a simple multiplication problem such as “2 ∗ 3”), or whether they are due to mislabelling of arithmetic problems into “easy” and “complex” based on surface criteria (e.g. some participants may involve counting strategies to solve “4 + 3”, but retrieve “4 ∗ 3” from memory). Although arithmetic operation type effects have previously been linked to the use of differential problem solving strategies in fMRI studies (cf. Grabner et al., 2009a; Rosenberg-Lee et al., 2011), and the idea that the neural circuits involved in mental arithmetic may determine the problem solving strategy has already been mentioned in the context of the Triple-Code Model (Dehaene, 1992; Dehaene et al., 2003), those strategies have not been investigated extensively in neuroimaging research. Instead, most previous fMRI studies have matched the complexity between different operation types only on the basis of surface criteria (e.g. the sum of the operands of problems such as “3 + 4” or “12 + 37”), but did not take into account individual differences in arithmetic strategies. Hence, reported operation type effects of previous neuroimaging studies need to be interpreted with caution. Some of these studies show clear evidence for a mismatch of general task complexity across operation types in behavioral measures of accuracy and reaction times (Chochon et al., 1999; Rosenberg-Lee et al., 2011; Zhou et al., 2006a), while other studies do not report behavioral measures for different operation types (Grabner et al., 2009a). This suggests that operation type specific effects may have been confounded with effects of general task complexity. Most studies contrasted tasks with numbers smaller versus larger than five (cf. Jost et al., 2004, 2009), tasks where the sum of numbers

was smaller versus larger than 25 (cf. Grabner et al., 2009a), tasks with or without carrying versus borrowing (Kong et al., 2005), or presented two-operand tasks versus three-operand tasks (Menon et al., 2000). However, complexity might systematically vary as a function of operation type and individual skills, which determine the applied arithmetic strategy (Grabner et al., 2007; LeFevre et al., 1996a, 2006). According to our knowledge, only one fMRI study has addressed strategy-use when matching difficulty across operation types (Grabner et al., 2009a), while focusing on angular gyrus' role in arithmetic factretrieval. However, complex arithmetic does not only involve fact retrieval, but also procedural knowledge, sequencing of operations, and working memory. Hence, arithmetic operation types may differ with respect to a number of variables that are not related to spatial, verbal or motor dimensions, but rather procedural features. Behavioral studies have shown that even “simple” problems (e.g. “3 + 4”), often assumed to be retrieved from memory, may invoke procedural strategies such as counting (cf. LeFevre et al., 1996a,b). If such strategies differ between operation types in a given experimental design, a careful analysis of procedural complexity is necessary, before conclusions from neural differences between them can be drawn. Evidence for the neural dissociation of differential arithmetic strategies comes from neurophysiologic studies. For example, the impairment of arithmetic fact retrieval from memory has been reported after left parietal and left subcortical lesions irrespectively of the type of arithmetic operation (Dehaene and Cohen, 1997; Warrington, 1982). A right intraparietal lesion caused impairments in quantity processing of numbers while the knowledge about arithmetic facts was intact (Dehaene & Cohen, 1997), and frontal lesions affected complex problems requiring multi-step arithmetic strategies (cf. Luria, 1966). In the Triple-Code Model Dehaene and Cohen (1995, 1997), suggest that a verbally mediated network of left perisylvian areas and left angular gyrus supports the retrieval of simple arithmetic facts. The processing of more complex arithmetic, for which direct retrieval of answers from memory is impossible, additionally requires procedural numbermanipulation strategies involving visual–spatial processes supported by bilateral posterior parietal lobule, and numerical quantity processing, associated with bilateral intraparietal sulcus. The predictions of the Triple-Code Model may be in line with a recent fiber tracking study, suggesting a predominance of ventral fiber tracks between left-hemispheric frontal and parietal regions for easy arithmetic, and dorsal as well as ventral streams for complex arithmetic tasks (Klein et al., 2013). One may interpret this result in favor of distinct neural networks for fact retrieval and procedural arithmetic strategies. Furthermore, Dehaene and colleagues also claim that fact retrieval and procedural strategies might be differentially relevant for different types of arithmetic operations, as suggested by specific deficits in two patients with a left subcortical lesion and right inferior parietal lesion respectively (Dehaene & Cohen, 1997). Operation type specific deficits have been also reported by other neurophysiologic studies (McCloskey et al., 1985). However, as Dehaene and colleagues point out, it might be the case that observed deficits for particular operation types rather reflected specific deficits for particular strategies that might have been more or less used as a function of experienced task complexity. Hence, in our view, the investigation of arithmetic strategies within neural networks of general problem solving is essential for answering questions concerning differences between operation types. Orthogonal assessment of arithmetic operation type and arithmetic strategy We conclude that it is still an open question as to what extent previously reported neural operation type effects might have been confounded with general aspects of task complexity. It is therefore important to analyze the strategies used for “simple” and “complex” arithmetic problems, and more importantly, how they differ for individuals and operation types. We here investigated this issue in behavioral and fMRI data, orthogonally varying operation type (addition and multiplication) and arithmetic strategy (multi-step procedural strategy versus memory


retrieval), as defined by individual strategy ratings on a trial-by-trial basis. Analyses with task conditions defined based on strategy ratings were compared with “classical” definitions of task complexity based on number size. In EEG time–frequency analyses, it has been suggested that self-report based strategy assessment is a more sensitive measure for the evaluation of complexity than number size based definition of task complexity (Grabner and De Smedt, 2011). We assessed the validity of our self-reports, relative to complexity definitions based on number size, by modeling reaction time distributions using ex-Gaussian functions (cf. LeFevre et al., 2006; Matzke and Wagenmakers, 2009), as well as by assessing the Number Size Effect in different conditions in a linear regression analysis (LeFevre et al., 1996b). Our study design allows the investigation of operation-specific effects while controlling for individual problem solving strategies, thus disentangling neural differences due to arithmetic operation types (addition versus multiplication) from those of general problem solving strategies (multi-step procedural strategy versus memory retrieval). We analyzed sensory–motor regions, based on a finger-localizer, parietal areas of numerical representations (Dehaene et al., 2003), as well as neural networks involved in general problem solving. For this, regions involved in general executive functions were selected based on the multiple-demand (MD) network of human intelligent behavior (Duncan, 2010). A meta-analysis on simple and complex mental calculation problems revealed activation in many of these multiple-demand regions (Arsalidou and Taylor, 2011). Material and methods Participants Data of 26 participants (13 males; 13 females) entered the final analysis. Data from two participants had previously been excluded because of unacceptable head movements within the scanning sessions. All participants were right-handed, had normal or corrected-to-normal vision, were educated in Western cultures (e.g. USA or Europe), and had no history of neurological or psychiatric disorder. Participants were pre-tested on mental calculation skills with a standard email-questionnaire. Only those subjects were selected based on the questionnaire, who indicated to solve the type of tasks employed in our study within 4 s. This procedure was chosen to ensure that selected participants were able to solve the majority of presented tasks in the experiment, and to exclude participants which might suffer from specific (undiagnosed) problems with mental calculation. Participants' IQ was assessed by using the CultureFair-Test, Scale 2 (Cattell and Cattell, 1960). The mean IQ of all participants was 130 (SD 18.5) (mean IQ of females 132.38 (SD 20.73); mean IQ of males 128.07 (SD 16.49)). Handedness was confirmed by a ten-item version of the Edinburgh Handedness Inventory (mean Laterality Quotient: 89; SD: 20) (Oldfield, 1971). Participants received about £40 for their participation and ethical approval was obtained from the Cambridge Local Research Ethics Committee. Stimuli and procedure 60 trials were presented in each of the four conditions (addition and multiplication tasks with two pre-defined levels of complexity each). The complex condition contained the combination of numbers 12–59 for addition tasks and the numbers 2–5 with 12–29 for multiplication tasks. The easy condition consisted of two 1-digit numbers. Those conditions were initially defined based on number size and number of operands, as well as reaction times obtained in two pilot-studies with different participants (N = 13 and N = 9 participants), which revealed no significant behavioral differences between operation types on both complexity levels. Surface features of stimuli were carefully matched: an equal amount of problems containing two even numbers, two odd numbers, as well as odd/even and even/odd number-combinations were chosen. Due to the lower number of available operands in the

371

easy conditions, tasks were presented twice but with reversed order of operands, and a small amount of ties (20%) were presented, in order to gain 15 problems for each combination of odd and even numbers. No number combination was presented twice in the difficult condition. The position of the larger operand was matched across all tasks. In a two-alternative-forced-choice (2AFC) design the correct solution was presented together with a distracter. For problems containing two 1-digit numbers (easy conditions), the distracter was within the range of plus/minus 2 of the correct solution. For complex problems, consisting either of combinations of 1-digit and 2-digit numbers, or two 2-digit numbers, 50% of the distracters were either within a range of plus/minus 2 of the correct solution (e.g. 56 and 54), or plus/minus 10 of the correct solution (e.g. 42 and 52) each. Exceptions were made for multiplication trials including the number 5: distracters in those trials were within a range of plus/minus 5 of the correct solution. The position of the correct solution on the screen was counterbalanced across all trials within each arithmetic task type. The maximum height of stimuli was 15 mm. Stimuli were presented within a visual angle of less than 4° in Calibri font. Participants responded to the task by pressing one of two buttons of a button box. Trials in the post-test consisted of the same arithmetic tasks as presented in the fMRI sessions, but were presented in a different order to reduce familiarity effects. Tasks of the fMRI session and behavioral posttest were divided into 8 blocks, i.e. 4 blocks per arithmetic operation. Levels of complexity were randomized within each block. 15 practice tasks were presented at the beginning of the fMRI experiment and post-test, which were not repeated during data acquisition. During the practice, participants received feedback about their performance and were encouraged to ask questions. In the fMRI experiment (Fig. 1, A), participants were requested to solve each presented problem within 4 s (jittered exponentially between 3.7 and 5 s to partly de-correlate activation from those of succeeding events) while the task stayed on the screen (e.g. “13 + 26”). After 4 s, a second operand (either plus or minus 1, 2 or 3) was presented with an exponentially jittered duration of 750 ms (time-range of 0.5–2 s). This second task was included in the trial sequence in order to make sure that no motor cortex activation related to button press responses appeared in the crucial calculation interval (cf. Jost et al., 2009). The task was followed by a 2AFC result-display, which was presented for 1750 ms. The 2AFC result display presented the correct answer next to a distracter, and participants were requested to indicate the correct answer by pressing a button on either the left or right side as soon as the result-display appeared. The side on which the correct answer appeared was counter-balanced across all experimental trials. We chose this verification design in order to produce comparable results to those previous neuroimaging studies on arithmetic problem solving, which in particular reported effects of arithmetic operation type and also used a verification procedure. The fixation-cross between trials had a jittered SOA of 1.5–3.5 s. After 50 min of mental calculation, the fMRI session finished with a 10-minute finger-localizer scan. Participants moved or rested their left and right index fingers corresponding to visual cues “Left”, “Right” and “Rest” on the screen. Each cue type was presented 5 ∗ 10 s in a randomized order. During the post-test (Fig. 1, B), reaction times and arithmetic strategies were measured for all tasks of the fMRI experiment. In a 2AFC design, tasks were presented together with two solution options for 4 s each. Participants were instructed to respond as fast and accurately as possible. After solving each task, participants indicated whether they retrieved the answer from memory (i.e. by pressing “known”), or whether they used any kind of procedural calculation strategy (i.e. by pressing “calculated”) based on the following instruction: “After each task you will be asked whether you just knew the answer, or calculated it in several steps. If you just knew the answer, please press the button on the side where the word “known” appears. If instead you calculated it in several steps, press the button on the side where the word “calculated” appears. Calculation of a task in several steps could for example mean:

372


Fig. 1. Trial sequence of the fMRI sessions (A), and behavioral post-fMRI test (B).

25 + 17 = 42 → 25 + 10 + 7 = 42.” Finally, participants' IQ was tested by using the Culture-Fair Test.

are solved via direct memory retrieval than the surface criteria based easy condition.

Analysis of behavioral data

fMRI parameters

Error rates and reaction times of each participant were extracted for all arithmetic conditions from post-test data, as well as behavioral measures obtained in the fMRI sessions. The percentage of items in each rated arithmetic condition, as well as the amount of “mismatch tasks” (e.g. tasks that were defined as “Complex” but rated as “Retrieval”) were analyzed. We further analyzed reaction time curves of conditions, which revealed significant changes due to re-organization of trials by strategy ratings, and compared those with the reaction time curves of surface criteria based task categories. This was done in order to investigate whether reaction time distributions of addition and multiplication tasks become more similar when categories are defined based on strategy ratings. For this, ex-Gaussian distributions were fitted to reaction times (cf. LeFevre et al., 2006; Matzke & Wagenmakers, 2009). The ex-Gaussian distribution results from the convolution of a Gaussian and an exponential distribution and can be described by three parameters: mu and sigma, which correspond to mean and standard deviation of a normal distribution, and tau, the mean of the exponential component, which reflects the tail of the distribution. However, the direct association of the ex-Gaussian distribution parameters with particular cognitive processes was challenged in the past (Matzke & Wagenmakers, 2009). Hence, the parameters will be discussed as descriptive measures only, but in respect to findings of previous studies, using the ex-Gaussian function as a tool to study differences in arithmetic strategy-use across participants (Campbell and Penner-Wilger, 2006; LeFevre et al., 2006; PennerWilger et al., 2002). Ex-Gaussian curves were first fitted to reaction time distributions of each participant. Differences between operation types were then tested for all three Ex-Gaussian parameters by using paired-sample t-tests on the group level. This was done for rating based and surface criteria based analyses separately. Further, reaction times were regressed on the sum of operands, in order to explore the Number Size Effect in retrieval and calculation tasks (cf. LeFevre et al., 1996b). This was done for each individual participant. Paired-sample t-tests on significant differences between rating based and surface criteria based analyses were run for regression coefficients of easy and complex tasks. An increased Number Size Effect for rating based procedural strategies, in contrast to surface criteria based complex tasks, would indicate that the rating based category in fact contains more trials which are solved via a procedural strategy. Conversely, a decreased Number Size Effect for rating based memory retrieval strategies, compared to surface criteria based easy tasks, would indicate that the memory retrieval condition contains more trials that

Participants were scanned in a 3-T Siemens (Munich, Germany) Tim Trio magnetic resonance system using a head coil. Echo-planar imaging (EPI) sequence parameters were TR (inter-scan interval) = 2 s, TE = 30 ms and flip angle = 78°. The functional images consisted of 32 slices covering the whole brain (slice thickness 3 mm, inter-slice distance 0.75 mm, in-plane resolution 3 × 3 mm). Imaging data were processed using SPM5 software (Wellcome Department of Imaging Neuroscience, London, UK; http://www.fil.ion.ucl.ac.uk/spm). Image processing and statistical analyses Images were realigned, coregistered, normalized and finally smoothed. This sequence of pre-processing steps was automated using software tools developed at the Cognition and Brain Sciences Unit (http://imaging.mrc-cbu.cam.ac.uk/imaging/AutomaticAnalysisManual). During the realignment process images were corrected for spatial movements and slice-timing, interpolating images in time to the middle slice using sinc interpolation. The EPI images were coregistered without skull stripping to the structural T1 images by using a mutual information coregistration procedure focused on intra-subject differences: images for the same subject from different scanning sessions were matched in space. The structural MPRAGE MRI (256 × 240 × 160, 1 mm isotropic) was normalized to the 152-subject T1 template of the Montreal Neurological Institute (MNI). The resulting transformation parameters were applied to the coregistered EPI images. During the spatial normalization process, images were resampled with a spatial resolution of 2 × 2 × 2 mm3. Finally, all normalized images were spatially smoothed with a 10-mm full-width half-maximum Gaussian kernel. The same sequence of processing steps was applied to the motor localizer data. First-level statistical contrasts were computed by using the general linear model based on the canonical hemodynamic response function (Friston et al., 1998). Low-frequency noise was removed with a high-pass filter (time constant 128 s for arithmetic sessions; 200 s for motor localizer). For comparison, the design matrix was set up for surface criteria based conditions, with easy and complex tasks defined based on number size, as well as for rating based conditions. For the latter, “retrieval” and “calculation” conditions were defined based on strategy ratings from the post-test (see above). Events were separately modeled for each of the eight fMRI sessions (four with addition tasks, and four with multiplication tasks). Within each session, every experimental trial was modeled separately in the design matrix as follows: the 4-second interval of arithmetic processing was modeled for retrieval and calculation tasks within each session in separate columns, with


duration of approximately 4 s depending on the individual presentation latency. The onset of a task (e.g. visual display of “4 + 3”) was modeled in a separate column across all trials within each session with no duration, in order to account for variance due to the onset of a visual stimulus that is common to all tasks. The additional operand (in-between the arithmetic task and the result-display) was modeled in a separate column with its duration, in order to reduce the influence of motor-preparation effects. Further, error-trials of each session were modeled with their respective durations in a separate column, as well as the first two scans in each session (“dummy-scans”) with their durations, and six movementparameters (the three parameters of translational and rotational movements, respectively). In the rating based analysis, reaction times from the post-test were attached as parametric modulator to all task onsets within each session, to account for performance related activity in stimulus encoding, as well as to retrieval and calculation tasks separately, in order to explain performance related variance within each complexity condition. However, this parametric modulation was done within each session and did not affect contrasts between operation types, considering that addition and multiplication tasks were modeled for separate sessions. Whether or not we included RT as a parametric modulator did not qualitatively affect our results (data not shown). For comparison purposes with previous study designs, no parametric modulator was included in the surface criteria based analysis. In the motor localizer task, we modeled the onsets of left hand movements, right hand movements, and rest-baseline as separate event types with their respective durations. Contrasts for events were defined on a single-subject level first, and then subjected to random-effects analysis for group statistics using SPM5. All analyses were performed for rating based and surface criteria based tasks separately. Whole-brain ANOVAs with the factors “Strategy” (retrieval versus procedural) and “Operation” (addition versus multiplication) were conducted for rating based analyses, and whole-brain ANOVAs with the factors “Complexity” (easy versus complex) and “Operation” (addition versus multiplication) were performed for surface criteria based analyses. Regions of interest (ROIs) were defined and analyzed using the Marsbar utility (Brett et al., 2002). ROIs were extracted from the motor localizer, MD network regions (Duncan, 2010), and parietal areas, involved in numerical perception and magnitude processing (Dehaene et al., 2003). Mean activation was extracted for spherical volumes of 10 mm radius. Parameter estimates were subjected to ANOVAs with the factors “Strategy” (retrieval versus procedural) and “Operation” (addition versus multiplication) for rating based analyses, and to ANOVAs with the factors “Complexity” (easy versus complex) and “Operation” (addition versus multiplication) for surface criteria based analyses. Results In the following, we will first confirm the validity of our strategy ratings on the basis of a detailed analysis of our behavioral data. We will then present fMRI analyses for the whole-brain level, as well as for a hypothesis-guided selection of ROIs. Crucially, strategy ratings from the behavioral post-test were used in order to define categories for easy and complex problems in fMRI data analyses. Results from rating based analyses were compared with conventional analyses, in which complexity levels were defined based on surface criteria (e.g. size of numbers and performance measures from pilot-studies). Behavioral data Mean error rate was 6.6% (SD 4.5) in the post-fMRI test, and 4.8% (SD 1.4) in the fMRI experiment. We have analyzed reaction times as well as strategy ratings from the post-test. Note that reaction times from the fMRI sessions do not directly reflect performance, because all tasks were presented for a jittered 4 second interval, while reaction times from the post-test indicate

373

solution times, i.e. when participants confirmed either one of the two options which were presented together with the task on screen. Behavioral results from the post-test were analyzed separately for conditions based on strategy ratings, versus for conditions based on surface criteria. On average, 12 multiplication tasks (SD = 10.4), and 6 (SD = 7.3) addition tasks were re-categorized due to individual strategy ratings. This means that slightly more multiplication tasks were re-categorized due to strategy ratings than addition tasks. However, considering the overall amount of 120 addition and multiplication tasks each, it is unlikely that this re-organization due to ratings had a significant impact on statistical power in analyses of operation type effects. Mean reaction times from the post-fMRI test are summarized in Fig. 2 (Panels A and B). Importantly, mean reaction times did not reveal differences between operation types when complexity levels were defined by participants' individual ratings (Fig. 2, A). However, significant differences between operation types were observed for complex tasks in surface criteria based analyses (Fig. 2, B) (Table 1, Panel A). Hence, strategy ratings had an impact on mean reaction times of complex tasks: mean reaction times for rated procedural arithmetic strategies did not differ for addition and multiplication, while surface criteria based complex tasks revealed differences between addition and multiplication. We further explored whether strategy ratings may also lead to an improved match in reaction time distributions of complex addition and multiplication tasks. These conditions differed in their mean reaction times in surface criteria based analyses, but not in rating based analyses. Ex-Gaussian functions were fitted to reaction times of both analyses (Figs. 2, B and C). Ex-Gaussian functions (Lacouture and Cousineau, 2008) are a hybrid of exponential and Gaussian functions which can be used to model distributions that are positively skewed, such as reaction time distributions. They model a reaction time distribution by three parameters: “mu”, which is a measure for central tendency (corresponding to the mean of a Gaussian distribution), “sigma” (corresponding to the standard deviation), and “tau” (which reflects the tail of the distribution). Most importantly, no differences in ex-Gaussian parameters were observed between addition and multiplication in rating based conditions, but mu differed significantly when tasks were defined based on surface criteria (Table 1, Panel B). Hence, ex-Gaussian analyses confirmed results from analyses of mean reaction times: operation type effects could be observed for surface criteria based conditions, but were absent in rating based conditions. Because self-report measures are subjective and may be biased (Kirk and Ashcraft, 2001), we also validated our categorization of arithmetic problems by analyzing the impact of strategy ratings on the Number Size Effect (i.e. that large problems, such as 7 + 8 take longer to solve than smaller problems, such as 3 + 4) (Ashcraft, 1992). The Number Size Effect refers to the finding that reaction times usually increase when the size of the problem (e.g. defined as the sum of the operands) increases (cf. Ashcraft, 1992). However, LeFevre and co-workers have shown that this is more the case when participants solve a problem using a procedural strategy, such as counting, while for simple problems that are directly retrieved from memory, the effect is absent or smaller (cf. LeFevre et al., 1996b). We therefore tested whether easy problems showed a smaller Number Size Effect in rating based analyses, and whether complex problems (procedural strategies) showed a larger Number Size Effect in rating based analyses, when compared with surface criteria based analyses. The Number Size Effect was analyzed in a linear regression model, in which reaction times were regressed on the sum of operands of each arithmetic task, such as in analyses of previous studies (LeFevre et al., 1996a,b). In paired-sample t-tests, rating based analyses revealed a significant larger Number Size Effect for procedural strategies in contrast to surface criteria based complex tasks. A significant smaller Number Size Effect was observed for rating based memory retrieval strategies, compared to surface criteria based easy tasks (Table 1, Panel C). This was also true when only single-digit numbers were considered for analyses, thus showing that the current

374


Fig. 2. Summary of behavioral results from the post-fMRI test. (A) Mean reaction times for rating based task categories. (B) Mean reaction times for surface criteria based task categories. (C, D) Ex-Gaussian functions, fitted to reaction time distributions of addition and multiplication tasks (here presented for complex task conditions only), for the rating based analysis (C) and surface criteria based analysis (D) separately. Reaction time distributions of addition and multiplication tasks were better matched in the rating based analysis. The red arrow (D) highlights the larger difference between distributions of addition and multiplication tasks in the surface criteria based analysis.

effects did not only depend on the specific range of number sizes in the predictor variable. The results indicate that the rating based procedural condition contains more trials which are solved via a procedural strategy, in contrast to the surface criteria based complex condition. Conversely, the rating based memory retrieval condition contains more

Table 1 Analyses of mean reaction times (Panel A), ex-Gaussian analyses (Panel B), and regression analyses on the Number Size Effect (Panel C). Contrasts

t(25)

p-Value

Panel A: operation type effects in mean reaction times Easy Add rating based (RB)–easy Mul RB Complex Add RB–complex Mul RB Easy Add (surface criteria) SC–easy Mul SC Complex Add SC–complex Mul SC

– – – 7.40

n.s. n.s. n.s. .000

Panel B: operation type effects in ex-Gaussian parameters Complex Add rating based (RB)–complex Mul RB Complex Add surface criteria (SC)–complex Mul SC

– 4.74

n.s. .000 for μ

Panel C: Number Size Effect in rating based vs. surface criteria based analyses All tasks Easy rating based (RB)–surface criteria (SC) −9.92 Easy Add RB-SC −7.30 Easy Mul RB–SC −10.24 Complex RB–SC 2.21 Complex Add RB–SC 4.93 Complex Mul RB–SC 7.23 Tasks including digits 1–9 only All single-digit tasks RB–SC −3.680 Add RB–SC −3.433 Mul RB–SC −2.751

.000 .000 .000 .036 .000 .000 .001 .002 .011

trials that are solved via direct memory retrieval, in contrast to the surface criteria based easy condition. fMRI whole-brain results Our main goal was to investigate differences between arithmetic operation types in sensory–motor regions, parietal regions of numerical processing, as well as regions of the multiple-demand network. In a first step, we determined the reliability of activation in these regions in a whole-brain analysis. For the rating based analysis, an ANOVA with the two within-subject factors “Strategy” (retrieval versus procedural) and “Operation” (addition versus multiplication) revealed highly significant main effects for Strategy (Fig. 3, A). No main effect of Operation, and no Operation-by-Strategy interaction was observed (Figs. 3, B and C), neither on a false-discovery-rate (FDR) corrected threshold, nor on a p(uncorrected) b .001 threshold. Effects of Strategy were found in line with our predictions: the contrast of fact retrieval N procedural strategies revealed bilateral angular gyrus activation (Brodmann's area (BA) 39), bilateral anterior dorsal-lateral prefrontal cortex and left orbito-medial prefrontal regions (BA 9 and 11), as well as right ventral-posterior cingulate cortex (BA 23). The reversed contrast of procedural strategies N fact retrieval showed activation in the left posterior superior parietal lobule (BA 7), in bilateral ventro-lateral prefrontal cortex and left Borca's area (BA 47 and 44), right dorsal-lateral prefrontal cortex (BA 46), left dorsal-anterior cingulate cortex (BA 32), and left para-hippocampus regions (BA 27) (Table 2). In order to replicate results from previous studies, we further analyzed the data with complexity categories based on surface criteria. We directly compared the rating based and surface criteria based analyses


375

Fig. 3. Whole-Brain ANOVA results for the rating based analysis. A: Main effect of Strategy. B: main effect of Operation Type. C: Operation-by-Strategy interaction. Note that results in A are presented at a family-wise error-corrected threshold, while those in panels B and C are displayed at a lenient uncorrected threshold and not considered reliable.

with each other, in order to demonstrate that both analyses were reliably different from each other. Rating based and surface criteria based analyses were contrasted in a whole-brain within-subject ANOVA with the factors “Analysis” (surface criteria based versus rating based), “Operation Type” (addition versus multiplication), and “Complexity” (complex versus easy conditions). An FDR-corrected significant Operation-byComplexity-by-Analysis interaction confirmed differences between analyses in right dorsal anterior cingulate cortex, right ventro-lateral PFC, right premotor cortex, left cerebral cortex, and left angular gyrus (see Table 3, Panel A; Fig. 4). Post-hoc tests revealed an FDR-corrected significant Operation-by-Complexity interaction for surface criteria based analyses in bilateral angular gyrus, left cerebral cortex, left dorsal-lateral prefrontal cortex, and left premotor and supplementary motor cortex (see Table 3, Panel B for details). Opposed to this, no reliable operation type effects were observed for the rating based analysis (see above). ROI analyses in sensory–motor regions, MD network, and parietal cortex The whole-brain analysis of our rating based data revealed no corrected-significant effects of Operation Type, while FDR-corrected

significant Operation-by-Complexity interactions were found in whole-brain analyses with surface criteria based tasks in frontal, parietal, and motor regions. We therefore performed a more detailed, and possibly more sensitive, hypothesis-guided ROI analyses in sensory– motor regions, parietal areas and the multiple-demand network, in order to rule out that differences in statistical power between analyses (due to for example re-organization of tasks by strategy ratings) caused the absence of operation type effects in rating based whole-brain analyses. For sensory–motor ROIs, peaks of activation were extracted from the motor-localizer for contrasts “Left Hand N Rest” and “Right Hand N Rest” (FDR-corrected p b 0.05). Six regions in bilateral primary motor cortex (BA 4) [− 36, − 12, 58; 52, −8, 54], bilateral premotor cortex (BA 6) [− 6, − 2, 56; 6, − 2, 58], and bilateral horizontal intraparietal sulcus (hIPS) (BA 40) [−36, −36, 40; 42, −38, 44] were selected. The following regions of the multiple-demand (MD) network (Duncan, 2010) were selected to investigate executive control processes during problem solving: right intraparietal sulcus (IPS) [37, −56, 41], right inferior frontal sulcus (IFS) [41, 23, 29], right anterior insula/adjacent frontal operculum (AI/FO) [35, 18, 2], dorsal anterior cingulate cortex (ACC) [0, 31, 24], and pre-supplementary motor area (SMA) [0, 18, 50], as well as the

376


Table 2 Strategy effect in rating based analysis, threshold p(FWE) b

.05, cluster size N

100 voxels.

Strategy effects in rating based analysis Region

Coordinate (MNI)

Cluster size (kE)

p(FWE)

Arithmetic fact retrieval N procedural strategies R BA 39 (angular gyrus (AG)) [58, −58, 42] L BA 39 (AG) [−58, −62, 32] L BA 11 (orbital–medial PFC) [−2, 28, −8] R BA 23 (ventral-post. cingulate cortex) [6, −50, 30] R BA 9 (anterior dorsal-lateral PFC) [14, 42, 50] L BA 9 (anterior dorsal-lateral PFC) [−8, 46, 46]

729 667 779 441 112 139

.000 .000 .000 .002 .002 .008

Procedural strategies N arithmetic fact retrieval L BA 7 (PSPL) [−30, −60, 50] L BA 32 (dorsal-anterior cingulate cortex) [−12, 12, 46] L BA 44 (ventro-lateral PFC (Broca's area)) [−54, 10, 28] R BA 47 (ventro-lateral PFC) [34, 26, 0] L BA 27 (para-hippocampus) [−22, −32, 6] R BA 46 (dorsal-lateral PFC) [34, 48, 18] L BA 47(ventro-lateral PFC) [−24, 28, −2]

18,005 3559 1413 923 895 537 297

.000 .000 .000 .000 .000 .000 .000

right rostrolateral prefrontal cortex (RPFC) [21, 43, −10]. Mean coordinates for parietal regions, reported as specifically involved in aspects of numerical processing, were taken from the review article by Dehaene et al. (2003) in the bilateral hIPS [−44, −48, 47; 41, −47, 48], bilateral PSPL [−22, −68, 56; 15, −63, 56], and left AG gyrus [−41, −66, 36]. No main effect of Operation and no Strategy-by-Operation interaction were found for the rating based analysis in ANOVAs with the factors “Strategy” (retrieval versus procedural) and “Operation” (addition versus multiplication). Significant main effects of Strategy occurred in sensory–motor regions, multiple-demand network, and parietal regions. While procedural strategies revealed stronger activations in a broad frontal–parietal network, significantly more activation for arithmetic fact retrieval was observed in the left AG. This is in line with our results from whole-brain analyses. Main effects of Strategy from ANOVAs, as well as separate t-statistics for addition and multiplication tasks, are reported in Table 4. In order to compare our results with those of previous studies which did not use any strategy self-reports, we analyzed activation within ROIs for surface criteria based stimulus categories as well. This analysis revealed differences between operation types in parietal and motor regions as well as in the MD network. In parietal regions, a significant Complexity-by-Operation interaction was found in the bilateral PSPL

(F(1,25) = 27.44, p b .000; F(1,25) = 11.34, p b .001 for left and right, respectively) and left AG (F(1,25) = 12.97, p b .001). A main effect of Operation revealed in the right PSPL (F(1,25) = 6.02, p b .021) and left AG (F(1,25) = 4.28, p b .049). Post-hoc comparisons revealed stronger activation for addition than multiplication tasks in right PSPL (t(25) = 2.45, p b .021), and stronger activation for multiplication than addition tasks in left AG (t(25) = − 2.07, p b .049). In regions of the MD network, significant Complexity-by-Operation interactions were observed in right AI/FO (F(1,25) = 5.29, p b .030), SMA (F(1,25) = 6.76, p b .016), and left ACC (F(1,25) = 4.92, p b .036). In motor-localizer regions, significant Complexity-by-Operation interactions were found in the left primary motor cortex (F(1,25) = 11.60, p b .002), and in the bilateral hIPS (F(1,25) = 6.39, p b .018; F(1,25) = 4.30, p b .048 for left and right, respectively). A main Operation effect was observed in right hIPS (F(1,25) = 4.28, p b .049). Post-hoc tests revealed stronger activation for addition than multiplication tasks in this region (t(25) = 2.07, p b .049). For direct comparison of analyses, activation for complexity contrasts of rating based and surface criteria based analyses is plotted in Fig. 5. The figure reveals a similar strength of activations for complexity contrasts in both analyses, suggesting that the absence of operation type effects in rating based analyses is unlikely due to a smaller overall statistical power. Discussion We asked to what degree brain activation during arithmetic problem solving is determined by general task complexity and operation type. We therefore orthogonally varied operation type and task complexity in our fMRI study, and defined complexity based on individual subjects' strategy ratings. These strategy ratings were validated by means of the well-established number size effect on reaction times, as well as by modeling reaction time distributions with ex-Gaussian functions. In contrast to previous studies, we did not find any reliable differences between operation types when controlling for individual strategies, although we could replicate previous effects when defining task complexity by surface-features (e.g. number size). Hence, our analyses do not support predictions of embodied numerical cognition theories with respect to a specific “grounding” of basic operation types in sensory and motor systems. However, differences between procedural and fact-retrieval strategies in fronto-parietal and sensory–motor regions support the idea that verbal and sensory–motor derived concepts may play a role in general problem solving.

Table 3 Panel A: results from comparison of rating based and surface criteria based analyses in within-subject ANOVA with the factors “Operation Type” (addition versus multiplication), “Complexity” (easy versus complex condition), and “Analysis” (rating based versus surface criteria based analysis). Panel B: activation for Complexity-by-Operation interaction in surface criteria based analysis, threshold p(FDR) b .05, cluster sized N 100 voxels. Multiple-comparison correction: ** = p(FWE); * = p(FDR). No significant operation type effects were observed for the rating based analysis. Panel A: comparison of rating based and surface criteria based analyses Region

Coordinate (MNI)

Operation-by-Complexity-Analysis interaction R BA 32 (dorsal anterior cingulate cortex) R BA 47 (ventro-lateral PFC) R BA 6 (premotor cortex) L BA 48 (cerebral cortex) L BA 39 (angular gyrus (AG))

[10, 26, 26] [34, 30, 4] [6, 14, 62] [−28, 24, 4] [−56, −58, 42]

Cluster size (kE) 191 218 292 49 10

Significance .037** .043** .032* .034* .043*

Panel B: post-hoc tests of operation type effects for each type of analysis Region

Coordinate (MNI)

Cluster size (kE)

Significance

Operation-by-Complexity interaction in surface criteria based analysis L BA 39 (AG) L BA 48 (cerebral cortex) R BA 39 (AG) L BA 46 (dorsal-lateral PFC) L BA 6 (premotor cortex) L BA 8 (supplementary motor cortex)

[−44, −75, 32] [−26, 16, 6] [48, −76, 32] [−44, 52, −6] [−52, −8, 50] [−22, 22, 46]

1081 2098 334 379 288 422

.018** .043** .000* .001* .002* .014*


377

Fig. 4. Comparison of rating based and surface criteria based analyses. Operation-by-Complexity-by-Analysis interaction from within-subject ANOVA with the factors “Operation Type” (addition versus multiplication), “Complexity” (easy versus complex task condition), and “Analysis” (rating based versus surface criteria based analysis).

The impact of participants' strategy ratings We analyzed our data in two different ways: Categorizing tasks as simple or complex based on individual participants' strategy ratings (i.e. “retrieval” versus “procedural” strategies), or based on surface criteria, in this case number size. Previously reported effects of arithmetic task complexity were replicated in both versions of our data analyses. However, effects of arithmetic operation types could only be replicated in analyses with complexity levels defined using surface criteria. Surface criteria based analyses revealed stronger activation for multiplication tasks in the left angular gyrus regions (Grabner et al., 2009a), and right parietal cortices (Rosenberg-Lee et al., 2011). Stronger activations for addition tasks were observed in parietal networks associated with visuo-spatial processing (Knops et al., 2009a; Zhou et al., 2006b), and in motor regions (Badets et al., 2010; Klein et al., 2011). However, these effects did not appear in our rating based analysis which controlled for individual strategies. This suggests that operation type effects in the Table 4 Significant main effects of “Strategy” (procedural N retrieval strategies) from ANOVAs, and separate t-statistics for addition and multiplication tasks, which correspond to bar graphs in Fig. 5. P-Values with asterisk are Bonferroni-corrected significant across all analyzed ROIs. Strategy effects in regions of interest of rating based analysis Main Effect Strategy

Addition ComplexNEasy

Multiplication ComplexNEasy

Regions

F(1,25)

p

t(25)

p

t(25)

p

Motor-localizer regions Left primary motor Right primary motor Left p re motor Right premotor Left hi PS Right hIPS

30.33 … 38.33 36.17 101.45 114.15

000* n.s. .000* .000* .000* .000*

5.127 … 4.807 4.526 8.699 7.813

.000* n.s. .000* .000* .000* .000*

4.128 … 4.624 4.246 7.111 7.674

.000* n.s. .000* .000* .000* .000*

Multiple-demand network regions Right IPS 46.26 Right IFS 15.56 Right AIFO 51.36 Right RPFC 6.42 SMA 57.40 ACC 8.03

.000* .001* .000* .018 .000* .009

5.773 3.160 5.164 … 6.472 …

.000* .004 .000* n.s. .000* n.s.

4.711 3.669 5.934 2.864 6.093 2.692

.000* .001* .000* .009 .000* .013

Parietal regions Left PSPL Right PSPL Left hi PS Right hIPS Left AG

.000* .000* .000* .000* .024

14.009 8.720 9.114 7.430 −3.655

.000* .000* .000* .000* .001*

8.628 9.968 6.484 7.373 …

.000* .000* .000* .000* n.s.

165.77 131.81 84.25 116.56 5.77

surface criteria based analysis were confounded by differences in task complexity, rather than inherent differences between addition and multiplication tasks. Stronger activation for procedural strategies than arithmetic fact retrieval was observed in prefrontal cortices, motor areas, posteriorsuperior parietal lobel (PSPL) and intraparietal sulcus (IPS), while more activation for arithmetic fact retrieval was found in bilateral angular gyrus. Effects for procedural strategies are in line with previous findings for contrasts of complex against easy arithmetic tasks (cf. Fehr et al., 2007; Grabner et al., 2009a; Gruber et al., 2001; Hanakawa et al., 2002, 2003; Jost et al., 2009; Menon et al., 2000), and effects in angular gyrus have been reported for easy arithmetic tasks in several other studies (Grabner et al., 2009a,b; Jost et al., 2011; Zago and Tzourio-Mazoyer, 2002). However, in contrast to previous research (Grabner et al., 2009a), no interaction with operation type, and no main effect of arithmetic operation was found in this region for rating based analyses. Operation type effects in surface criteria based analyses of the current study can be interpreted as differences in procedural complexity: More activation in the left AG for complex multiplication (cf. Grabner et al., 2009a), and more activation in PSPL and sensory–motor regions for complex addition in surface criteria based analyses, may be due to strategy-differences. This is in line with the idea that the left AG supports retrieval processes, while the PSPL and sensory–motor regions activate during procedural strategy-use. This indicates that self-report measures in the current study have de-confounded effects of arithmetic strategies from effects of arithmetic operation types. Hence, previously reported operation type effects in brain activation are not indicative of operation type specific processes or representations. Rather, they may reflect differences in individually experienced levels of complexity. Addition and multiplication may rely on the same representations, processes, and brain systems for fact retrieval and multi-step procedures, but differences in previous brain imaging studies may have emerged due to invalid categorization of addition and multiplication into “easy” and “complex”. Validation of strategy self-reports Our detailed analysis of behavioral data confirmed the superiority of our rating based approach compared to traditional surface-based approaches. The validity of individual ratings for problem solving strategies has been investigated in behavioral (LeFevre et al., 2006; SmithChant and LeFevre, 2003) and neuroimaging research (Grabner & De Smedt, 2011). However, because this approach has been frequently challenged (cf. Kirk & Ashcraft, 2001), we compared self-reports with reaction time measures in two ways: the Number Size Effect was

378


Fig. 5. Results of ROI analyses for rating based (RB) and surface criteria (SC) based analyses. Bar graphs display the mean activation of each complexity contrasts (procedural N retrieval strategies/easy versus complex surface criteria), with the standard error of the mean difference between addition (Add) and multiplication (Mul). Blue bars refer to ROIs taken from Dehaene et al. (2003), yellow bars to ROIs extracted from the finger localizer, and red bars to the multiple-demand network (Duncan, 2010). Significant differences between operation types are indicated by an asterisk. All regions not labeled as ‘not significant’ (n.s.) exceeded the significance threshold of p b .05. AI/FO = anterior insula/adjacent frontal operculum; ACC = anterior cingulate cortex; SMA = supplementary motor area; RPFC = rostrolateral prefrontal cortex; IPS = intraparietal sulcus; PSPL = posterior superior parietal lobule; AG = angular gyrus; MC = motor cortex; PMC = premotor cortex. Medial regions are displayed on the lateral surface for purposes of visual simplicity. Effects of operation type are only observed in surface criteria based analyses, while the over-all amplitude of the signal does not differ for rating based and surface criteria based analyses.

analyzed using linear regression (reaction times regressed on the sum of presented operands), and ex-Gaussian functions were fitted to reaction time distributions.

Previous work has shown that the Number Size Effect, i.e. the increase of reaction times for simple arithmetic problems with the magnitude of the result (Ashcraft, 1992, 1995; Groen and Parkman, 1972), is


379

an indicator of procedural rather than retrieval strategies (LeFevre et al., 1996a). We could show that in rating based analyses, compared to surface criteria based analyses, the Number Size Effect decreased for simple problems but increased for complex problems. This indicates that we indeed separated retrieval from procedural strategies through participants' self-reports (cf. LeFevre et al., 1996a,b). This could be further confirmed in an analysis of reaction time distributions using ex-Gaussian functions. Parameters of the ex-Gaussian distribution did not differ for operation types in the rating based analysis, but did so in the surface criteria based analysis.

motor systems may therefore support working memory processes during complex arithmetic problem solving. With respect to sub-goal sequencing, it has been suggested that premotor regions are involved in sequencing of numbers (Hanakawa et al., 2003; Honda et al., 2002; Kansaku et al., 2007), processing of syntactic structures (Friederici, 2006; Schubotz and von Cramon, 2003), and that they are sensitive to the complexity of sequences (for review, see Schubotz and von Cramon, 2002). Cortical motor systems may therefore play a role in the sequencing of sub-processes during complex arithmetic problem solving.

Neural networks of procedural arithmetic strategies

Conclusion

Our findings question previous interpretations that addition and multiplication rely on different brain systems. Instead, we argue that in order to meaningfully interpret differences between operation types, one needs to consider addition and multiplication within general frameworks of problem solving, and perform a more fine-grained analysis of the sub-processes involved. Specific effects for procedural strategies have been observed in three different neural systems in our rating based analyses, which might subserve different aspects of abstract problem solving, irrespectively from the type of arithmetic operation: a) a frontal–parietal system of general executive control, b) a lateral prefrontal and temporal network for the retrieval of learned rules and strategies from long-term memory, and c) a sensory–motor network, potentially involved in working memory processes and sequencing of sub-goals during strategy execution. With respect to the frontal–parietal network of executive control, it has been observed that procedural arithmetic strategies produced strong activation in all regions of the multiple-demand (MD) network (Duncan, 2010), including the IPS, inferior frontal sulcus (IFS), anterior insular and frontal operculum (AI/FO), pre-supplementary motor regions (pre-SMA), and rostral prefrontal cortex (RPFC). Arithmetic complexity effects have previously been reported in frontal cortex (cf. Arsalidou & Taylor, 2011). Evidence from neuroimaging and neuropsychological studies suggests that processes of cognitive control, such as required in tests on fluid intelligence, evoke strong activation in MD network regions (Duncan et al., 2000; Woolgar et al., 2010). With respect to memory retrieval of arithmetic strategies and facts (cf. Dowker, 2005), procedural strategies in the current study produced highly reliable activation in whole-brain analyses in the bilateral ventro-lateral prefrontal cortex (VLPFC) and around Broca's area (BA 47 and 44), as well as in the right ventral dorsal-lateral prefrontal cortex (DLPFC) (BA 46). These regions do not belong to the MD network, and recent evidence from human (Bunge, 2004; Bunge et al., 2003; Rowe et al., 2000) and monkey (cf. Bongard and Nieder, 2010) research suggests their involvement in rule-selection and retrieval of abstract rules from long-term memory. In addition to activation in general problem solving networks, the strong involvement of left primary motor and bilateral premotor regions in procedural arithmetic strategies was a striking finding in the current study. Pre-motor cortex has previously been found to be sensitive to complexity effects in arithmetic (Arsalidou & Taylor, 2011; Gruber et al., 2001; Jost et al., 2009; Menon et al., 2000). On the basis of these results, we argue that sensory–motor derived concepts may support cognitive processes relevant for procedural arithmetic strategies, such as working memory and sub-goal sequencing. With respect to working memory, van Dijck and Fias (2011) have shown that spatial conceptualizations of numbers (as documented by the SNARC effect; Dehaene et al. (1993)) can be explained by the organization of numbers in working memory, rather than a semantic representation of numbers on a mental number line. Right premotor cortex activation has been observed in spatial working memory tasks (Sadato et al., 1996; Smith and Jonides, 1999) and Shebani and Pulvermüller (2013) have shown that motor system activation can interfere with performance on short-term memory tests for action-words. Sensory–

Our results do not support the view that different arithmetic operations inherently rely on different brain systems, as proposed by theories of embodied numerical cognition, but they suggest that different types of arithmetic strategies are specifically supported by brain systems involved in executive function, language, and sensory–motor processes. Hence, certain types of general abstract problem solving strategies might be still “embodied”. The specific role of cortical motor systems in, for example, the sequencing of sub-steps of a problem solving process should be a topic for future investigations. Therefore, by combining behavioral measures of performance and strategy-use with neuroimaging data, arithmetic may be a powerful paradigm to systematically investigate the spatio-temporal brain dynamics during complex problem solving, taking into account inter-individual differences in skill and practice. Acknowledgments This work was supported by the Medical Research Council UK (O. Hauk: MC-A060-5PR40). N. Tschentscher was supported by the Gates Cambridge Scholarship. We would like to thank Dr. John Duncan for comments on the interpretation of our results, and Dr. Peter Watson for statistical advice. References Anderson, J.R., Betts, S., Ferris, J.L., Fincham, J.M., 2011. Cognitive and metacognitive activity in mathematical problem solving: prefrontal and parietal patterns. Cogn. Affect. Behav. Neurosci. 11 (1), 52–67. Andres, M., Pelgrims, B., Michaux, N., Olivier, E., Pesenti, M., 2010. Role of distinct parietal areas in arithmetic: an fMRI-guided TMS study. NeuroImage 54 (4), 3048–3056. Andres, M., Michaux, N., Pesenti, M., 2012. Common substrate for mental arithmetic and finger representation in the parietal cortex. NeuroImage 62 (3), 1520–1528. Arsalidou, M., Taylor, M.J., 2011. Is 2 + 2 = 4? Meta-analyses of brain areas needed for numbers and calculations. NeuroImage 54 (3), 2382–2393. Ashcraft, M.H., 1992. Cognitive arithmetic: a review of data and theory. Cognition 44 (1–2), 75–106. Ashcraft, M.H., 1995. Cognitive psychology and simple arithmetic: a review and summary of new directions. Math. Cogn. 1, 3–34. Badets, A., Pesenti, M., Olivier, E., 2010. Response–effect compatibility of finger-numeral configurations in arithmetical context. Q. J. Exp. Psychol. 63 (1), 16–22. Bongard, S., Nieder, A., 2010. Basic mathematical rules are encoded by primate prefrontal cortex neurons. Proc. Natl. Acad. Sci. U. S. A. 107 (5), 2277–2282. Brett, M., Anton, J.L., Valabregue, R., Poline, J.B., 2002. Region of interest analysis using an SPM toolbox. Paper Presented at the 8th International Conference on Functional Mapping of the Human Brain, Sendai, Japan. Bunge, S.A., 2004. How we use rules to select actions: a review of evidence from cognitive neuroscience. Cogn. Affect. Behav. Neurosci. 4 (4), 564–579. Bunge, S.A., Kahn, I., Wallis, J.D., Miller, E.K., Wagner, A.D., 2003. Neural circuits subserving the retrieval and maintenance of abstract rules. J. Neurophysiol. 90 (5), 3419–3428. Campbell, J.I.D., Penner-Wilger, M., 2006. Calculation latency: the mu of memory and the tau of transformation. Mem. Cogn. 34 (1), 217–226. Cattell, R.B., Cattell, A.K.S., 1960. Handbook for the Individual or Group Culture Fair Intelligence Test. Testing Inc., Champaign, IL. Chochon, F., Cohen, L., van de Moortele, P.F., Dehaene, S., 1999. Differential contributions of the left and right inferior parietal lobules to number processing. J. Cogn. Neurosci. 11 (6), 617–630. Dehaene, S., 1992. Varieties of numerical abilities. Cognition 44 (1–2), 1–42. Dehaene, S., Cohen, L., 1995. Towards an anatomical and functional model of number processing. Math. Cogn. 1, 83–120. Dehaene, S., Cohen, L., 1997. Cerebral pathways for calculation: double dissociation between rote verbal and quantitative knowledge of arithmetic. Cortex 33 (2), 219–250. Dehaene, S., Cohen, L., 2007. Cultural recycling of cortical maps. Neuron 56 (2), 384–398.

380


Dehaene, S., Bossini, S., Giraux, P., 1993. The mental representation of parity and number magnitude. J. Exp. Psychol. Gen. 122 (3), 371–396. Dehaene, S., Piazza, M., Pinel, P., Cohen, L., 2003. Three parietal circuits for number processing. Cogn. Neuropsychol. 20 (3–6), 487–506. Dowker, A., 2005. Individual Differences in Arithmetic: Implications for Psychology, Neuroscience and Education. Psychology Press, Hove. Duncan, J., 2010. The multiple-demand (MD) system of the primate brain: mental programs for intelligent behaviour. Trends Cogn. Sci. 14 (4), 172–179. Duncan, J., Seitz, R.J., Kolodny, J., Bor, D., Herzog, H., Ahmed, A., et al., 2000. A neural basis for general intelligence. Science 289 (5478), 457–460. Fehr, T., Code, C., Herrmann, M., 2007. Common brain regions underlying different arithmetic operations as revealed by conjunct fMRI-BOLD activation. Brain Res. 1172, 93–102. Fischer, M.H., 2012. A hierarchical view of grounded, embodied, and situated numerical cognition. Cogn. Process. 13 (1), 161–164. Friederici, A.D., 2006. Broca's area and the ventral premotor cortex in language: functional differentiation and specificity. Cortex 42 (4), 472–475. Friston, K.J., Fletcher, P., Josephs, O., Holmes, A., Rugg, M.D., Turner, R., 1998. Event-related fMRI: characterizing differential responses. NeuroImage 7 (1), 30–40. Grabner, R.H., De Smedt, B., 2011. Neurophysiological evidence for the validity of verbal strategy reports in mental arithmetic. Biol. Psychol. 87 (1), 128–136. Grabner, R.H., Ansari, D., Reishofer, G., Stern, E., Ebner, F., Neuper, C., 2007. Individual differences in mathematical competence predict parietal brain activation during mental calculation. NeuroImage 38 (2), 346–356. Grabner, R.H., Ansari, D., Koschutnig, K., Reishofer, G., Ebner, F., Neuper, C., 2009a. To retrieve or to calculate? Left angular gyrus mediates the retrieval of arithmetic facts during problem solving. Neuropsychologia 47 (2), 604–608. Grabner, R.H., Ischebeck, A., Reishofer, G., Koschutnig, K., Delazer, M., Ebner, F., et al., 2009b. Fact learning in complex arithmetic and figural-spatial tasks: the role of the angular gyrus and its relation to mathematical competence. Hum. Brain Mapp. 30 (9), 2936–2952. Groen, G.J., Parkman, J.M., 1972. A chronometric analysis of simple addition. Psychol. Rev. 79, 329–343. Gruber, O., Indefrey, P., Steinmetz, H., Kleinschmidt, A., 2001. Dissociating neural correlates of cognitive components in mental calculation. Cereb. Cortex 11 (4), 350–359. Hanakawa, T., Honda, M., Sawamoto, N., Okada, T., Yonekura, Y., Fukuyama, H., et al., 2002. The role of rostral Brodmann area 6 in mental-operation tasks: an integrative neuroimaging approach. Cereb. Cortex 12 (11), 1157–1170. Hanakawa, T., Honda, M., Okada, T., Fukuyama, H., Shibasaki, H., 2003. Differential activity in the premotor cortex subdivisions in humans during mental calculation and verbal rehearsal tasks: a functional magnetic resonance imaging study. Neurosci. Lett. 347 (3), 199–201. Hauk, O., Tschentscher, N., 2013. The body of evidence: what can neuroscience tell us about embodied semantics? Front. Cogn. Sci. 4 (50). Honda, M., Hanakawa, T., Sawamoto, N., 2002. Differential activation of rostral part of premotor cortex in mental operation. Society for Neuroscience Abstract Viewer and Itinerary Planner (Abstract No. 373.375). Jost, K., Beinhoff, U., Hennighausen, E., Rosler, F., 2004. Facts, rules, and strategies in single-digit multiplication: evidence from event-related brain potentials. Cogn. Brain Res. 20 (2), 183–193. Jost, K., Khader, P., Burke, M., Bien, S., Rosler, F., 2009. Dissociating the solution processes of small, large, and zero multiplications by means of fMRI. NeuroImage 46 (1), 308–318. Jost, K., Khader, P.H., Burke, M., Bien, S., Rosler, F., 2011. Frontal and parietal contributions to arithmetic fact retrieval: a parametric analysis of the problem-size effect. Hum. Brain Mapp. 32 (1), 51–59. Kansaku, K., Carver, B., Johnson, A., Matsuda, K., Sadato, N., Hallett, M., 2007. The role of the human ventral premotor cortex in counting successive stimuli. Exp. Brain Res. 178 (3), 339–350. Kirk, E.P., Ashcraft, M.H., 2001. Telling stories: the perils and promise of using verbal reports to study math strategies. J. Exp. Psychol. Learn. Mem. Cogn. 27 (1), 157–175. Klein, E., Moeller, K., Willmes, K., Nuerk, H.-C., Domahs, F., 2011. The influence of implicit hand-based representations on mental arithmetic. Front. Psychol. 2, 197. Klein, E., Moeller, K., Glauche, V., Weiller, C., Willmes, K., 2013. Processing Pathways in Mental Arithmetic - Evidence from Probabilistic Fiber Tracking. PLoS ONE 8 (1), e55455. Knops, A., Thirion, B., Hubbard, E.M., Michel, V., Dehaene, S., 2009a. Recruitment of an area involved in eye movements during mental arithmetic. Science 324 (5934), 1583–1585. Knops, A., Viarouge, A., Dehaene, S., 2009b. Dynamic representations underlying symbolic and nonsymbolic calculation: Evidence from the operational momentum effect. Atten. Percept. Psychophys. 71 (4), 803–821.

Kong, J., Wang, C., Kwong, K., Vangel, M., Chua, E., Gollub, R., 2005. The neural substrate of arithmetic operations and procedure complexity. Cogn. Brain Res. 22 (3), 397–405. Lacouture, Y., Cousineau, D., 2008. How to use MATLAB to fit the ex-Gaussian and other probability functions to a distribution of response times. Tutor. Quant. Methods Psychol. 4 (1), 35–45. Lakoff, G., Núñez, R.E., 2000. Where Mathematics Comes From: How the Embodied Mind Brings Mathematics Into Being. Basic Books, New York. Lee, K.M., Kang, S.Y., 2002. Arithmetic operation and working memory: differential suppression in dual tasks. Cognition 83 (3), B63–B68. LeFevre, J.A., Bisanz, J., Daley, K.E., Buffone, L., Greenham, S.L., Sadesky, G.S., 1996a. Multiple routes to solution of single-digit multiplication problems. J. Exp. Psychol. Gen. 125 (3), 284–306. LeFevre, J.A., Sadesky, G.S., Bisanz, J., 1996b. Selection of procedures in mental addition: Reassessing the problem size effect in adults. J. Exp. Psychol. Learn. Mem. Cogn. 22 (1), 216–230. LeFevre, J.A., DeStefano, D., Penner-Wilger, M., Daley, K.E., 2006. Selection of procedures in mental subtraction. Can. J. Exp. Psychol.-Rev. Can. Psychol. Exp. 60 (3), 209–220. Luria, A.R., 1966. The Higher Cortical Functions in Man. Basic Books, New York. Matzke, D., Wagenmakers, E.J., 2009. Psychological interpretation of the ex-Gaussian and shifted Wald parameters: a diffusion model analysis. Psychon. Bull. Rev. 16 (5), 798–817. McCloskey, M., Caramazza, A., Basili, A., 1985. Cognitive mechanisms in number processing and calculation: evidence from dyscalculia. Brain Cogn. 4 (2), 171–196. Menon, V., Rivera, S.M., White, C.D., Glover, G.H., Reiss, A.L., 2000. Dissociating prefrontal and parietal cortex activation during arithmetic processing. NeuroImage 12 (4), 357–365. Oldfield, R.C., 1971. Assessment and analysis of handedness — Edinburgh inventory. Neuropsychologia 9 (1), 97–113. Penner-Wilger, M., Leth-Steensen, C., LeFevre, J.A., 2002. Decomposing the problem-size effect: a comparison of response time distributions across cultures. Mem. Cogn. 30 (7), 1160–1167. Pinhas, M., Fischer, M.H., 2008. Mental movements without magnitude? A study of spatial biases in symbolic arithmetic. Cognition 109 (3), 408–415. Prado, J., Mutreja, R., Zhang, H., Mehta, R., Desroches, A.S., Minas, J.E., et al., 2011. Distinct representations of subtraction and multiplication in the neural systems for numerosity and language. Hum. Brain Mapp. 32 (11), 1932–1947. Rosenberg-Lee, M., Chang, T.T., Young, C.B., Wu, S., Menon, V., 2011. Functional dissociations between four basic arithmetic operations in the human posterior parietal cortex: a cytoarchitectonic mapping study. Neuropsychologia 49 (9), 2592–2608. Rowe, J.B., Toni, I., Josephs, O., Frackowiak, R.S.J., Passingham, R.E., 2000. The prefrontal cortex: response selection or maintenance within working memory? Science 288 (5471), 1656–1660. Sadato, N., Campbell, G., Ibanez, V., Deiber, M.P., Hallett, M., 1996. Complexity affects regional cerebral blood flow change during sequential finger movements. J. Neurosci. 16 (8), 2693–2700. Schubotz, R.I., von Cramon, D.Y., 2002. Predicting perceptual events activates corresponding motor schemes in lateral premotor cortex: an fMRI study. NeuroImage 15 (4), 787–796. Schubotz, R.I., von Cramon, D.Y., 2003. Functional–anatomical concepts of human premotor cortex: evidence from fMRI and PET studies. NeuroImage 20 (1), 120–131. Shebani, Z., Pulvermüller, F., 2013. Moving the hands and feet specifically impairs working memory for arm- and leg-related action words. Cortex 49 (1), 222–231. Smith, E.E., Jonides, J., 1999. Neuroscience — storage and executive processes in the frontal lobes. Science 283 (5408), 1657–1661. Smith-Chant, B.L., LeFevre, J.A., 2003. Doing as they are told and telling it like it is: selfreports in mental arithmetic. Mem. Cogn. 31 (4), 516–528. Tschentscher, N., Hauk, O., Fischer, M.H., Pulvermüller, F., 2012. You can count on the motor cortex: finger counting habits modulate motor cortex activation evoked by numbers. NeuroImage 59 (4), 3139–3148. van Dijck, J.P., Fias, W., 2011. A working memory account for spatial–numerical associations. Cognition 119 (1), 114–119. Warrington, E.K., 1982. The fractionation of arithmetical skills: a single case study. Q. J. Exp. Psychol. A 34 (1), 31–51. Woolgar, A., Parr, A., Cusack, R., Thompson, R., Nimmo-Smith, I., Torralva, T., et al., 2010. Fluid intelligence loss linked to restricted regions of damage within frontal and parietal cortex. Proc. Natl. Acad. Sci. U. S. A. 107 (33), 14899–14902. Zago, L., Tzourio-Mazoyer, N., 2002. Distinguishing visuospatial working memory and complex mental calculation areas within the parietal lobes. Neurosci. Lett. 331 (1), 45–49. Zhou, X.L., Chen, C.S., Dong, Q., Zhang, H.C., Zhou, R.L., Zhao, H., et al., 2006a. Event-related potentials of single-digit addition, subtraction, and multiplication. Neuropsychologia 44 (12), 2500–2507. Zhou, X.L., Chen, C.S., Zang, Y.F., Dong, Q., Chen, C.H., Qiao, S.B., et al., 2006b. Dissociated brain organization for single-digit addition and multiplication. NeuroImage 35 (2), 871–880.

When problem size matters: differential effects of brain stimulation on arithmetic problem solving and neural oscillations.

Strategies to enhance problem solving.

Fractionating the neural correlates of individual working memory components underlying arithmetic problem solving skills in children.

How To…:Construct Problem-Solving MCQs.

How effective are common ENT operations?

Individual differences in children's innovative problem-solving are not predicted by divergent thinking or executive functions.

Medical problem-solving: an exploration of strategies.

Neural correlates of mathematical problem solving.

A problem-solving routine for improving hospital operations.

Using a general problem-solving strategy to promote transfer.

Differences in Arithmetic Performance between Chinese and German Children Are Accompanied by Differences in Processing of Symbolic Numerical Magnitude.

A problem-solving model for teaching remedial arithmetic to handicapped young children.

How Can Vaccines Contribute to Solving the Antimicrobial Resistance Problem?

Differences in arithmetic performance between Chinese and German adults are accompanied by differences in processing of non-symbolic numerical magnitude.

Are Individual Differences in Arithmetic Fact Retrieval in Children Related to Inhibition?

Weight-loss strategies used by the general population: how are they perceived?

Brain hyper-connectivity and operation-specific deficits during arithmetic problem solving in children with developmental dyscalculia.

Differences in shoot Na+ accumulation between two tomato species are due to differences in ion affinity of HKT1;2.

An integrative view of Luria's perspective on arithmetic problem solving: the two sides of environmental dependency.

Arithmetic and algebraic problem solving and resource allocation: the distinct impact of fluid and numerical intelligence.

Syntactic Awareness and Arithmetic Word Problem Solving in Children With and Without Learning Disabilities.

A recurrent neural network for solving bilevel linear programming problem.

Clinical problem-solving: how sure is sure enough?

'If we want things to stay as they are, things will have to change'*.