viernes, 15 de mayo de 2015

Simple Linear Regression (2012)

Definition


Linear Regression is a data analysis tool that allows us to model a linear relationship between two variables. Linear regression is most frequently used to make predictions based on actual data (Johnson & Christensen, 2012; Kohout, 1974; Hatch & Lazarton, 1991).

History


The first use of linear regression is attributed to Sir Francis Galton, cousin of Charles Darwin, on 1886. Most of his work focused on the relationships between sizes of vegetables, and also correlations between people’s heights (Klugh, 1986).

Assumptions



Linear Regression assumes certain conditions in order to model our data. These assumptions can also help you decide if Linear Regression is a convenient tool for your research. The degree of certainty of our assumptions will make our model more robust. Some fundamental and most frequently used assumptions are the following:

I. Ratio/Interval data in your variables
    • Data you can establish proportions with, and that holds a true zero (Krieg, 2012).
    • E.g. Test scores, age, time, amount of words, etc. 
II. Data are distributed normally
    • Most of the data will be clustered around a central value, the distribution is roughly symmetric, and less data is placed on the extremes.
    • Some tests you can run to verify normality are the Fisher Information Test of Normality, Allan Variance, Entropy methods, etc

III. Variables are correlated
    • One variable depends on the other.
    • If one variable increases, the other decreases or increases correspondingly.
    • Pearson’s r is close to 1 or -1.


Why do we use Linear Regression?

    • Because it is the simplest way to model your data, therefore a good first approach.
    • Because most relationships between variables in Social Sciences are roughly linear.
    • Because linear regression helps to approach research questions that involve relationship between two variables.

How do we use Linear Regression?

I. Decide your independent variable (X)
    • Depending mainly on your background knowledge, hypothesis, and control.
II.Verify data in each variable is normally distributed 
    • Use tests suggested above
III. Plot the data 
    • Place you independent variable on the X-axis and the dependant variable on the Y-axis of the scatter plot.
    • Each point on the scatterplot will have a (Xi, Yi) coordinate.
    • A first glance at the data will help you decide if it is convenient to continue using linear regression or if you need another model (Diamond & Jefferies, 2001).
IV. Calculate correlation coefficient
    • Calculate Pearsons’ r coefficient.
    • Use critical values as a guide for determining the usefulness of the obtained r.
V. Trace the regression line or "best fitting line". 
    • Use least squares method (recommended).
    • This method will provide the parameters a±∆a  and b±∆b, to build the linear equation: ^y= (a±∆a)x + b±∆b. Where a is the rate of change of ^y per 1 unit in x, and b is the value of ^y, when x=0
    • On the scatterplot, a is the slope of the line, and b is the point where the line goes through the y-axis. 

VI. Determine the error of the regression line.
    • Calculate the vertical difference between your actual data and the regression line: the distances are called residuals.
    • Verify the normal distribution of your model. Plot a histogram of the residuals, and check their normality. 
    • Calculate the mean of residuals and check that this value is close to zero. 
    • Calculate the standard deviation of residuals. 


    • Use the regression line equation and error to make accurate predictions of data in the range of the X data set.

Note:  It is recommended to calculate the confidence intervals for your parameters a and b using the “bootstrap method” (Efron & Tibshirani, 1986)

Advantages and Limitations


Pros
    • "A linear relationship is the most elementary form and hence a reasonable first approximation" (Knoke, Bohrnstedt, & Potter Mee, 2002).
    • If data are not visually linear, you can still “rectify” them and use linear regression.
    • Most statistics packages run Linear Regression automatically.
Cons
    • Its robustness is limited to its assumptions.
    • Make sure to account for the "Regression to the mean effect" (Klugh, 1986; Marascuilo & Serlin, 1988) in order to avoid this phenomenon; make sure your research design is suited for linear Regression analysis. 
    • Linear Regression does not determine causality between variables.

Sample Study


Source: Stricker, L. (2004) The performance of native speakers of English and ESL speakers on the computer-based TOEFL and GRE General Test. Language Testing, 21 (2) 146 -173.

Summary 

The study attempts to verify the construct validity of Computer-Based TOEFL. Two studies were articulated, one aiming to see whether native speakers and ESL speakers perform similarly. A second study investigated the relationship between TEOFL scores and GRE scores for both native and ESL speakers. 

Regarding the second study, previous research had found that verbal components of admission tests for adults are found to be highly correlated with ESL test-takers’ TOEFL scores. 

Methods

Participants were adult test-takers of both tests for the first time, all of them pursuing graduate education. Native speakers were recruited from all over U.S., while participants in the ESL group came from different parts of the globe.

Scores were pooled from ETS databases, sampling was performed according to restrictions, such as, having taken both tests in no more than 15 days of difference.

Results

Study 1: Native speakers perform slightly better than ESL speakers, and regarding the maximum possible score. Also the native speaker group showed less variance than the ESL group. 

In study 2: linear regression analysis showed that Computer-based TOEFL scores were moderately or highly correlated with scores on the GRE, and that this relationship was linear for the analytical and quantitative components. However, regression analysis discovered that the relationship between the verbal section of GRE and TOEFL was non linear, in fact, the shape of this relationship showed that even low proficiency level ESL speakers can obtain high scores in the GRE verbal portion, suggesting that the verbal portion of GRE General test, might not be sufficiently making use of verbal components.


Bibliography and Recommended Readings


Diamond, I., & Jefferies, J. (2001) Beginning Statistics: An Introduction for Social Scientists. London: Sage.

Efron, B., & Tibshirani, R. (1986). Bootstrap Methods for Standard errors, Confidence Intervals, and Other Measures of statistical accuracy. Statistical Science , 1 (1), 54-75.

Hatch, E., & Lazarton, A. (1991). The research manual: Design and statistics for applied linguistics. New York: Newbury House Publishers.

Johnson, B., & Christensen, L. (2012). Educational Research: Quantitative, qualitative, and Mixed Approaches. (4th edition ed.). California: Sage.

Klugh, H. E. (1986). Statistics: The essentials for research. Hillsdale, New Jersey: Lawrence Erlbaum Associates.
noke, D., Bohrnstedt, G. W., & Potter Mee, A. (2002). Statistics for Social Data Analysis (4th edition ed.). Itasca, Ill: F. E. 
Peacock Publisher.

Kohout, F. J. (1974). Statistics for Social Scientists: A coordinated learning system. Malabar, FL: Robert E. Krieger .

Krieg, E. J. (2012). Statistics and Data Analysis for Social Sciences. Boston: Pearson Education.

Marascuilo, L. A., & Serlin, R. C. (1988). Statistical Methods for the Social and Behavioral Sciences. New York: W. H. Freeman and Company.

Stricker, L. (2004) The performance of native speakers of English and ESL speakers on the computer-based TOEFL and GRE General Test. Language Testing, 21 (2) 146 -173.

...and the help of my great friend Néstor Espinoza.
















Attention and Noticing in Second Language Learning

A literature Review 
2012



Introduction 

After Schmidt’s Noticing Hypothesis (1986, 1990) claimed that noticing is the necessary and sufficient condition for intake to occur, the debate on noticing and its effects of second language learning has motivated a lot of research and theoretical discussion in SLA. However, nowadays, the discussion does not seem settled and due to the great relevance these concepts have for SLA research, it seems convenient to review and clarify them. 

The purpose of the present literature review is to inform the reader about the debate in SLA regarding the concepts of noticing and acquisition, focusing on the different attempts to operationalize the constructs. 

This paper is organized in the following manner: First it untangles the concepts of attention and noticing from other similar and related ideas such as consciousness and awareness; Second, it reviews positions about the critical question ‘is learning without noticing or attention possible?’, and presents some alternative issues or questions that the debate have risen. And it finally reviews for the some relevant research findings in SLA research on noticing. 

Method 

The present literature review include works up to found in University of Toronto, OISE library catalogue, as well as articles in the databases of: CBCA Education, ERIC, FRANCIS, Linguistics and Language Behavior Abstracts, Proquest Education Journals & Psyinfo. The article search was carried out on the search engine Proquest, using boolean search with a combination of the words “noticing” and/or “attention”, and consciousness and or/ awareness, plus ‘second or foreign language education’, resulting in 145 citations, from which only those regarding studies on noticing and attention and SLE which discuss about the concepts were included, while articles where noticing and attention were only used to draw conclusions in research on other topics, in SLA were excluded, and so were articles related to ADHD or other learning disabilities. 

Untangling noticing and attention 

Several attempts to define the construct of noticing and attention in SLA literature seem to be embedded in a broader discussion about consciousness. From this debate, two trends can be identified: one related to awareness (Schmidt, 1990, 1993, 1994, 1995; Tomlin and Villa, 1994), and the other one regarding the process vs. product dichotomy (VanPatten, 1990, 1994). 

Attention 

Schmidt (1994, 1995) operationalizes the ambiguous construct of consciousness by distinguishing specific processes: consciousness as intentionality, as attention, as awareness and as control. Drawing on psychological assumptions, Schmidt (2001) describes attention as limited in capacity, selective, and partially voluntary; he also portrays it as the door to consciousness, or as a filtering system, facilitating or inhibiting perceived stimuli to be further processed in the mind. 
Moreover, based on information-processing models, Schmidt (1990) claims that short- term memory can be equated with consciousness, and that attention constitutes the link between 
short-term memory and stimuli that is perceived. Put another way, attention carries stimuli into consciousness, to eventually be stored in long-term memory. Similarly, models of automatic/ controlled processing, identify consciousness as control, which always requires attention. In this sense, attention makes learning possible since it allows control processes to become automatic. 
Similarly, VanPatten (1994) also recognizes the role of awareness in attention, but he criticizes the use of psychological operationalization of the construct, since insights from research in psychology do not use real language on their studies, but rather artificial grammar or finite state grammar, which do not represent the complexities of real meaningful language; Also he claims that SLA needs to develop their own knowledge about attention, by studying the construct in learning real languages. 

Also drawing on psychology, but using a more SLA approach, Tomlin and Villa offer a definition of attention by dividing it into three mechanisms, which are the following: Alertness, Orientation and Detection. Alertness is a state of predisposition for receiving external stimuli. It influences the selection of stimuli and it can be manipulated and assessed. More alertness may lead to an increase of rate at which stimuli is perceived, however, too much alertness might affect accuracy of processing, leading to errors. In SLA contexts, Alertness can be considered as a learner's motivation, interest or readiness (p. 297), whereas orientation means a certain direction. It implies that some parts of the stimuli will be attended and others ignored or inhibited. If attention is oriented toward a certain item, that item will be detected. Detection is the cognitive registration in memory of particular segments of stimuli that allows further processing of them. Detecting depletes almost all attentional resources and therefore, detecting one aspect of stimuli will inhibit the detection of other. 

Tomlin & Villa claim that none of these processes require awareness, and that detection can even occur at an unconscious level. They conclude that attention is necessary for awareness, but attention can work without awareness; and that awareness can enhance alertness and orientation but not detection. 

It is important to note in this account, that Tomlin and Villa, though acknowledging the crucial role of these three mechanisms, they do not equate these with learning, nor do they claim that these are sufficient for learning to occur, but rather facilitate further processing of input and its eventual consequent learning. 

Noticing 

It seems that noticing has been defined as the conscious experience of attention, or focal attention, that can therefore, be reported by subjects. In this sense, noticing seems to be attention plus awareness, however, in the untangling of constructs, it seems that different level of awareness produce different concepts. 

In his Noticing Hypothesis, Schmidt (1990, 1993, 1994, 1995) claims that noticing is the necessary and sufficient condition for input to become intake, and defines “noticing” as low level awareness and nearly equates the term with attention, but claims that when we are aware of what is in our focal attention, then we are noticing. Schmidt’s (1990) Noticing is stimuli that goes 
beyond perception, it is stimuli in the focal awareness that is subjectively experienced. When noticing something, it is possible to report it, but failure to report the experience of noticing does not imply the lack of it. Furthermore, If the level of awareness is greater, so we engage in analysis or association with previous experience, then we are in presence of understanding, a different process that requires a higher level of awareness. 

Therefore we can say that Schmidt’s Noticing is an experience of low or medium awareness, since it requires more than pre-attentive mechanisms, but if more awareness is involved, it turns to something different. From my perspective, it seems that noticing is the first step to learning, since it allows us to later engage in higher processing of input. It is also important to note that Schmidt’s noticing is different from Swain’s concept of Noticing, which in the Output Hypothesis, refers to detecting a gap, a lack of knowledge of ability in one’s Interlanguage. In this sense, we can say that Swain’s Noticing is introspective, whereas, Schmidt’s noticing refers to noticing something in input or stimuli, a more external idea of the concept.

On the other trend, VanPatten (1994) clarifies that consciousness must be understood from the perspectives of process, product, context and purpose, and that it is in the processing of input where attention becomes key. He underscores that attention is limited and does require a certain level of consciousness, and claims that what is really relevant to research in SLA is what learners attend to in the input. This way, the old implicit/explicit knowledge and acquisition/ learning distinctions are better understood by determining whether attention is on meaning or form. 

Overall, there seems to be a mild consensus on the role of attention in learning a second language, and the distinction of different mechanisms implied in the processing of input. However, what remains is the debate on the role of awareness in learning, the need to define an SLA-related construct to attention and noticing, more bound to the complexities of language learning; and the old discussion of whether there can be learning without attention and noticing, which is the topic of the next section. 

Can there be learning without noticing or attention? 

There seems to be a trend in SLA literature towards reformulating this question. However two positions are clearly identified, those who reject the possibility of unconscious learning, and those who consider it plausible. 

Aligning to the first position, Schmidt (1990, 1993, 1994, 1995, 2001) denies the possibility of what he refers to as subliminal learning, based on two kinds of arguments: his own research, and a methodological criticism to research that claims unconscious learning to be possible. 
Regarding his own research (Frota & Schmidt, 1986), Schmidt claims that noticing highly correlates with learning. The study recorded his own experience learning Brazilian Portuguese in Brazil. He kept a diary where he reported the things he noticed about language and was recorded on their conversations with native speakers. Results show that Schmidt’s output did not show all the forms available in class or in the input, but rather exactly those that he reported to have noticed. Schmidt claims that this study provides evidence of the connection between noticing and the emergence of linguistic forms in output. 

Regarding methodological issues in other research, he argues that although methods to measure attention, noticing and intake (e.g. like diaries or think-aloud protocols) provide valuable information about what learners are aware of or have noticed, they overshadow the question of whether unattended stimuli translate in learning. He also claims that other techniques implying computerized measurements such as mouse-clicking and eye-movement tracking, only allow us to know learners' orientation, but not detection. 

Additionally, he rejects evidence in psychology of subliminal learning, i.e. learning below the threshold of awareness. According to him, results from these investigations demonstrates]\ that although perceived, subliminal stimuli does not impact behavior or motivation at a higher level. Also, experiments frequently use previously known items, such as words in participants’ native language, and what the results really show is pre-attentive perception or short-term memory storage, which altogether is not equatable to learning a second or foreign language, because the latter is composed of novel items, in complex systems, that must commit to long- term storage and translate in consequent use. 

Moreover, Schmidt (2001) acknowledges some methods to measure non-conscious registration v.s. conscious noticing like verbal reports or priming. However, he argues most researchers wrongly assume that an inaccurate verbal report equals poor detection, disregarding the possibility that, even when detection occurs, some learners are simply not able to put their experience into words. He also criticizes the inferences researchers make from the priming method, since this method grounds on previously perceived stimuli, and not on learning new items. 

In summary, Schmidt claims that evidence for unconscious learning has been rather assumed than proved, and therefore, he moves away from a zero-point question position into a more spectrum view. He again turns to psychology, which denies learning without attention, and reviews some research (e.g. Curran and Keele, 1993) that suggest that it is the degree of attention which yields degrees of learning, i.e. the more we pay attention, the more we learn. He takes this claim to SLA pointing out other work (Van Patten, 1994) that suggests that different features of language, like grammar, might require greater amounts of attention, while others might need less. 

Similarly, VanPatten (1994) replaces the common dichotomy in research of conscious/ unconscious, and explicit vs. implicit learning, since according to him, this debate focuses on products and does not account for how linguistic features are processed in learners' minds and that most research at that time measured knowledge and equated the type of knowledge to a certain type of attention. In other words, when learners were able to articulate rules (product), it was assumed that they were consciously processing input (process). Furthermore, he claims that in order to research attention, its measurement must always be linked with comprehension, since learners always go for meaning first. And he proposes two main hypothesis regarding the way learners allocate attention when learning features of a language: 
H1: Learners process meaning from input before they process it for form.
H2: In order for learner to process form that is not meaningful, they must be able to process informational or communicative content at no or little cost to attention (p. 32). 


In this sense, VanPatten though aligning with the position of no unconscious learning, also avoids the question of whether learning without attention is possible, and instead, focuses on what learners attend to in the input, and how this process unfolds. 

On the opposite position, Tomlin and Villa’s model of attention claims that detection leads to processing and learning, although detection can happen with or without awareness, this implying that unconscious learning is possible. 

Additionally, Truscott (1998) challenges Schmidt’s Noticing Hypothesis, claiming that its foundations are weak, since evidence in cognitive psychology regarding attention and consciousness is inconclusive. He argues that research on attention to tasks and information only demonstrates that attention is beneficial, but not necessary or even sufficient. He also claims that the Noticing Hypothesis is vague and difficult to interpret and operationalize. He argues that their foundations are more related to the lack of evidence of the possibility of subliminal learning, than on unequivocal evidence of the necessity of awareness in learning. Furthermore, he criticizes that the Noticing Hypothesis does not ground on a clear singular view of language and, therefore it is incapable of making predictions. For example, he claims that the Noticing Hypothesis does not account for which specific aspects of grammar or lexis must be noticed in input, and adds that most of the examples provided by Schmidt (1995) seem to be more clearly explained through his own concept of understanding, rather than noticing. 

Although thought-provoking, Truscott’s argument seem to be directed only to the psychological evidence Schmidt uses to formulate the Noticing Hypothesis, and he disregards that the main source of evidence he used was his own study (Schmidt, 1986) and the correspondence of attention, awareness and learning. 

Evidence of noticing and its relationship to learning in SLA 

Evidence from research regarding Noticing and SLA that was included in this review can be classified in three groups: evidence about what and how to prompt noticing, evidence about how noticing and attention work during input processing, and evidence about the relationship between noticing and L2 learning. 

Prompting Noticing 

In a classroom-based quasi-experimental study, Bao et al. (2011) found that certain types of recasts, like rising intonation, can benefit noticing. In their study with adult ESL learners, two main methods were used to measure noticing, stimulated recall and uptake measure, which consisted in registering a learner’s response to recast, either on self-repair or a failed attempt to do so. 
One specific finding relevant to the Noticing debate is that stimulated recall was much better able to elicit instances of noticing, than the uptake measurement, suggesting that, verbal reports, as account of the subject experience of noticing are still valid in research. 

How noticing works 

VanPatten (1990) claims that attention to meaning and form in the input compete. In this sense, if learners are required to attend to form, overall comprehension will decrease if simultaneous attention is required to both aspects of language in the input. He hypothesizes that since attention is limited in capacity, it is only when learners automatize meaning comprehension they are able to focus their attention on formal aspects of input. i.e. higher levels of second language proficiency should allow learners to notice more form in the input. 
In his experimental study with university level students of Spanish as a foreign language, he required participants to listen to a monolog in Spanish and check every time they heard a certain target form. Control group was required to listen only for content, and after the treatment all participants’s comprehension was tested by a free-recall written test, which was scored according to the quantity of ideas learners were able to recall. 
Results show that VanPatten hypotheses were correct, since ANOVA displayed that comprehension was negatively affected by attention to form, but that this problem is less serious if learners’ proficiency is higher. Regarding language features, those related to morphology seem to be more disruptive in processing meaning, than are lexical items. 
However, although this study provides insights about the way attention is allocated during input processing, it does not directly account for the crucial relationship between noticing and learning, a topic which is explored in the section below. 

Noticing and L2 learning 

In a quasi experimental study (Mackey, 2006), interactional feedback was found to prompt noticing and lead to L2 development. Interestingly, the aspect most prone to development after noticing was in this case, questions. Thus providing some insight on the role of noticing in learning formal aspects of an L2. 
Mackey based on Tomlin and Villa’s criticism about the coarse nature of noticing measurement and developed a triangulation method to explore the role of noticing in L2 learning. The study was carried out with 28 ESL university students in U.S., all in the higher- intermediate level. Participants were divided into two groups with a one teacher each, and the target forms were questions, plurals and past tense. 

The treatment consisted in TBL activities set up as games about american T.V. were participants asked and answered questions to each other. In the experimental group the teacher offered interactional feedback, while in the control group the teacher did not. 
Noticing, in this study, was operationalized as “a learner’s report indicating a mismatch between the target language form and the learner’s non-targetlike production or comprehension” (p. 413) therefore following Swain’s idea of “noticing the gap” in the Output Hypothesis. And it was measured in 4 ways: journals, oral stimulated record using the videotaped lessons, written responses about classroom activities in their L1, and written responses on an L2 questionnaire. A pre test and post test elicited the target structures in descriptions of events where contexts and clues were given. Answers were coded according to participants’ level of awareness, assuming that a poor report of noticing did not mean lack of noticing. 

Mackey concluded that Interactional feedback does prompt noticing of forms in the input, and furthermore, participants that reported noticing forms in the input, were found to use those forms in the immediate post test, thus lending evidence that noticing leads to intake. 
Similarly, in a recent pilot study, Godfroid et al. (2010) measure spontaneous noticing (i.e. learners’ processing of input without teachers instructing them to do so) in meaning focused reading and its relationship with pseudo-words uptake. 
Godfroid et al. align with the idea of measuring noticing while embedded in comprehension (VanPatten, 1994) and the further transportation of noticed items to uptake, following Schmidt's ideas. Therefore, target items in the study, i.e. pseudo-words, were embedded in real L1 paragraphs. 

Researchers used a variety of methods: Eye-movement tracker was used to measure noticing, assuming that eye-fixation denotes focal attention; stimulated recall to elicit qualitative data, regarding the subjective experience of participants and a vocabulary discrimination post- test to measure intake. 

Results reveal that, although almost accurately, participants were able to identify the previously encountered words, thus, meaning that what one notices generates at least a short memory trace, leading researchers to conclude that noticing does lead to intake. However, Godfroid et al. caution readers that although eye-fixation and memory trace seem to be linked, there is no warranty that only this amount of attention leads to learning, since we ignore the depth of processing participants engaged with when encountering new words. It is highly possible that deep processing of noticed items is what leads to intake, and not noticing itself. 

Conclusion 

The present paper has reviewed the theoretical debate regarding attention and noticing in the context of SLA and their role in learning. Although it has been more than twenty years since the Noticing Hypothesis was proposed, it seems that consensus has not been reached regarding whether attention is mandatory for learning to occur. However, the debate seem to move towards weaker versions or positions in which noticing is necessary, but not the only requirement for learning (Cross, 2002). 

In light of these scenario, it seems wise to look at results from research on noticing more cautiously, and follow suggestion of moving from questions that deals in absolutes, towards a more spectral vision of phenomena to be applied both in research design and research and interpretation of evidence. 

References 

Bao, M.; Egi, T.; Han, Y. (2011). Classroom study on noticing and recasts features: capturing learner noticing with uptake and stimulated recall. System, 39, pp. 215-228. 

Cross, J. (2002). ‘Noticing’ in SLA: is it a valid concept?. Teaching English as a Second or Foreign Language, 6(3). Retrieved from http://www.tesl-ej.org/wordpress/issues/volume6/ej23/ ej23a2/ 

Curran, T. & Keele, S.W. (1993). Attentional and non-attentional forms of sequence learning. Journal of Experimental Psychology: Learning, Memory, and Cognition, 19, 189-202.

Schmidt, R., & Frota, S. (1986). Developing basic conversational ability in a second language: A case study of an adult learner of Portuguese. In R. R. Day (Ed.), Talking to learn: Conversation in second language acquisition (pp. 237-326). Rowley, MA: Newbury House. 

Godfroid, A., Housen, A. & Boers, F. (2010). A procedure for testing the Noticing Hypothesis in the context of vocabulary acquisition. In Cognitive processing in second language acquisition : Inside the learner's mind. Pütz M., Sicola L. (Eds.), . Philadelphia: John Benjamins Pub. 

Mackey, A.(2006) Feedback, noticing and instructed second language learning. Applied linguistics, 27(3), pp. 405-430 

Schmidt, R. (1990). The role of consciousness in second language learning. Applied Linguistics, 11, 129-158. 

Schmidt, R. (1993). Awareness and second language acquisition. Annual Review of Applied Linguistics, 13, 206-226. 

Schmidt, R. (1994) Desconstructing consciousness is search of useful definitions for applied linguistics. In Consciousness in second language learning(1994). Hulstijn J. H., Schmidt R. and International Association of Applied Linguistics. (Eds.), . [Amsterdam?]: Association internationale de linguistique appliquée. 

Schmidt, R. (1995) Attention and awareness in foreign language learning. In Schmidt R. (Ed.), . Honolulu, Hawai'i: Second Language Teaching & Curriculum Center

Schmidt, R. (2001). Attention. In Cognition and second language instruction, Peter Robinson (Ed.). Cambridge: Cambridge University Press. 

Swain, M. (1995) Three functions of output in second language learning. In G. Cook & B. Seidlhofer (eds), Principles and practice in the study of language . Oxford: Oxford University Press. 

Tomlin, R. and Villa, V. (1994). Attention in cognitive sciences and second language acquisition. Studies in Second Language Acquisition, 16, 183-203. 

Truscott, J. (1998). Noticing in Second Language Acquisition: A critical review. Second language research, 14, 103-135 

VanPatten, B. (1990) Attending to form and content in the input: An experiment in consciousness. Studies in Second Language Acquisition. 12(3), pp 287-301 


VanPatten, B. (1994) Evaluating the role of consciousness in second language acquisition: terms, linguistic features and research. In Hulstijn J. H., Schmidt R. and International Association of Applied Linguistics. (Eds.), [Amsterdam?]: Association internationale de linguistique appliquée. 

Elaborative processing and Second Language Vocabulary Learning


A lit Review2011

Introduction

It is a common matter of discussion and research, whether there exists a unique or best approach that would facilitate Second Language Learning and memorization of vocabulary. Several methods have transcended history, such as Loci, or the classic Rote Learning, but the effectiveness of these does not seem to be sufficient to convince scholars.

That’s how Craik and Lockhart’s (1973) levels of processing framework, poses a whole new paradigm for research, distinguishing between superficial processes (those related to word form) and deep processes (semantic manipulation) and further arguing that only deep processing leads to effective long-term learning. The levels of processing framework influenced the work of scholars like Schmeck (1977) on the Inventory of Learning Strategies, which served as the foundation for curricular projects and paradigms across the globe, in countries such as Chile (Trufello, et al. 2001).

However, it’s been more than 30 years since the model was proposed, and it seems appropriate to look at the advances or drawbacks that research have found regarding the effectiveness of elaborative processing. This paper reviews the available literature on the topic of Elaborative Processing and its advantages and limitations for Second Language Vocabulary Acquisition.

Methodology of the literature review


General information was sought consulting the search engine of Google scholar for articles related to “elaborative processing” and “second language learning”, resulting in about 10 pages of citations from which only those published in academic journals were included in this review.

In order to look for specific and more rigorous information, OISE library SUMMON, formerly Scholars Portal system, was consulted specifically in the Psyinfo, PsycARTICLES, PsycCRITIQUES, InfoTrac Psychology eCollection, ChildDevelopment and Adolescent Studies and ERIC databases, using Boolean search with combinations of the terms “elaborative processing” and “vocabulary” or “second language education” between the years 1960 and present. The search resulted on 123 citations, mostly psychometric experimental studies, from which only those related to learning and language learning were included.


Origins of Elaborative Processing


Craik and Lockhart’s (1973) levels of processing framework claims that deeper mental processing leads to better learning. In their work, the psychologists scrutinize the multi-store model (Atkinson & Shiffrin, 1968)  and criticize it for not properly accounting for issues such as capacity, coding and forgetting. r. In their view, the storage model seems to be inconsistent across different studies and paradigms regarding the mentioned issues, and thus they review contemporary studies regarding memory, focusing on the process on encoding.

Previous studies (Bobrow & Bower, 1969; Rosenberg & Schiller, 1971) had suggested that retention is higher when learning occurs with an orienting task in which learners need to manipulate meaning over word structure. Additionally, the authors recognize the facilitative role of selective attention and meaningfulness in processing and encoding and claim that at deeper levels of thought, people engage in using previously learned structures to enrich the analysis of the stimulus, leading to better retention.

Although Craik and Lockhart’s work constitutes a framework, it has influenced many studies in the following decades, which have actually found empirical evidence of the role of elaborate processing in L2 learning.

For instance, in 1977 Schmeck developed the Inventory of Learning Processes, which was an instrument that categorized students learning strategies; grounded on the Information Processing theories (Craik & Tulvig, 1975) and Levels of Processing Framework (Craik & Lockhart, 1973), and based on a study who analyzed of school students study tactics, 

Later, in 1991, Schmeck et al. published a revised version of the same questionnaire, after testing their validity against 3 other well-known personality tests. The new version, called ILP-R presents 4 dimensions containing sub-scales with categorize learners study strategies into: Reflective processing, Agentic processing and Methodical study.

In the study, Schmeck expanded Craik and Lockhart’s idea of breadth of processing besides depth, and acknowledged elaborative processing as different from deep processing, in which the former involves the self in the analysis of input. As Schmeck et al. point out:

“Elaborative processing involves self-reference, essentially encoding new information in terms of personal metaphor and personal vocabulary; self-expression is a determination to question authority and resolutely express one's own opinions.” (p.350)

Additionally, the study found out that reflective students, that is,  those who make use of deep, elaborating processing or self expression techniques have better school performance than students whose behaviour matches other scale dimensions. Moreover, the correlations suggested that high self-concept is a fairly good predictor of academic success. Thus, high self-concept and reflective processing are two key factors of academic achievement. Consequently, the authors argue that emotional investment and high self- concept are prerequisites for improving cognitive skills.


Evidence and limitations of Elaborative processing in general L1 learning


Several studies highlight the positive effect of elaborative strategies in achieving better and longer-term learning. However, same studies point out the limitations such as, amount of background knowledge and time constrains or pace.

For example, Willoughby et al. (1994) argue that elaborative strategies are most effective when sufficient amount of background knowledge is available. The researchers investigated the role of prior knowledge in elaborative interrogation strategies (e.g. asking “why” questions about the items) and imagery-based or keyword strategies on the assumption that elaborative interrogation strategies activate schemata allowing learners to connect prior knowledge to new information, which in turn, facilitates retrieval (Anderson, 1990; Bower, Black &Turner, 1979; Fiske, 1984; Thorndyke & Yekovich, 1980).

The experimental design study consisted of three experiments that were carried out with undergraduate students, contrasting different strategies and the role of background knowledge in immediate retention.

The study concluded that elaborative strategies are most effective when sufficient amount of background knowledge is available.

In experiment 1, elaborative and imagery-based conditions outperformed control rote learning; however, the first two resulted in equal performance, when prior knowledge and contextual cues were available. Interestingly, when prior knowledge was not available, e.g. learning new unfamiliar animals, neither of the conditions was better than rote learning, but imagery-based strategies surpassed elaborative interrogation.

In experiment 2, participants with high background knowledge engaged in elaborative interrogation condition outperformed other groups and conditions; however, all conditions were same in the low background knowledge group. The study also showed that the more students used background information, during studying the facts in the elaborative interrogation condition, the better they recalled the facts in the post -test. Thus suggesting the important role of background or prior knowledge in linking new information.

Moreover, keyword strategies were also tested and unsurprisingly outperformed all other strategies in immediate recall. However, these findings may be taken cautiously, since contemporary research had proven that the keyword method is more prone to long-term forgetting than other strategies (Thomas and Wang, 1992; 1996).

In other aspect of elaborative processing research, Benjamin & Bjork’s (2000) work about word recognition focusing on accuracy and speed of retrieval under the conditions of rote and elaborative rehearsal in undergraduate students learning familiar and new words in their first language. In the three experiments that composed such study, elaborative rehearsal led to higher and longer-term recall than rote rehearsal condition, under normal speed. Thus concluding that elaborative rehearsal leads to more accurate word recognition. However, the opposite is true under speeded retrieval, where rote rehearsal outperformed elaborative learning. That is to say, it takes longer to remember something that we learn via elaborative rehearsal.

The study also found that longer time of exposure to conditions, i.e. more study time led to better performance both in rote and elaborative rehearsal under normal speed conditions.

According to the researchers, the fact that items learned with elaborative rehearsal suffer under time pressure, might be due to the longer time it takes for learners to access a memory trace that consist of more associations, as Benjamin and Bjork point out:

Creating a unique, perhaps linked, set of associations does indeed foster more likely retrieval, but mentally traversing those links is a time-consuming (and perhaps resource- consuming) process. (p. 646)

Despite of these findings, the constrains found for elaborative rehearsal might not be significant for all activities or disciplines. Since the difference in recall was about a matter of seconds, quick recall might not be equally important for a firefighter making quick decisions in order to save a life than for a second language learner answering a test in the classroom.

In summary, research on learning items in learners’ first language show that elaborative processing does represent an effective learning strategy, nevertheless it must fulfill certain conditions in order to take advantage of it. 

Evidence and limitations of Elaborative processing in L2 acquisition


Significant evidence of the beneficial effect of Elaborative Processing in second language learning was presented in Coomber et al. (1986) whose study showed that composing learning, that is, sentence elaboration, has a significant effect in vocabulary learning, outperforming other types of learning, such as presenting examples or rehearsal.

The experimental design evaluates the condition of learning with examples against learning through composing, i.e. deep level processing (Emig, 1977), and traditional Rote Rehearsal of definitions in learning new lexical items. The experimental items were a set of 10 artificial words; and the participants, college students, were tested shortly after the treatment.

The study found that sentence composing condition outperforms example-based and rehearse techniques, thus supporting elaborative processing theory. However, There were no significant difference between example-based and rehearse techniques.

Coomber’s study is one of the few that refers to “learning” instead of mere “memory” and highlights that elaborative learning is active learning and involves learner’s previous knowledge and decisions (Di Vesta and Peverly, 1984). This is particularly important since it resonates with Willoughby and Craik & Lockhart’s idea of using familiar or prior knowledge to analyze new information.

However, although the authors highlight the role of previous knowledge, this was not measured in the study, since the lexical items used in the study were artificial, thus, it is very difficult to determine the real depth of processing in which participants engaged. At the same time, Coomber et al. failed in providing any information regarding the effect of elaborate processing in long-term retention.

In a later study, Fraser (1999) found that Incidental L2 vocabulary learning is significantly enhanced when learners are formally instructed in the use of learning processing strategies such as, inferring or consulting. Using a quasi experimental design and the Think aloud protocol, Fraser studied ESL francophone university-level participants’ use of Lexical processing strategies when encountering a new word and compared their behavior before and after receiving formal training in the use of lexical processing strategies (LPS). 

The results showed that the most used LPS is inferring. Participants tended to infer more than ignore or consult new words; and tended to ignore less new words after formal LPS instruction, which might suggest a developing lexical awareness mediated by intentional learning. However, the strategy of consulting was more effective (i.e. participants obtained the correct meaning of the word) than inferring, and both strategies improved significantly after LPS training.

Additionally, sense creation inference, i.e. context driven guess, was most frequently used and effective than lexeme activation inference, i.e. a word form activates an L1 or L2 association, like cognates; which gives us a clue of the facilitative role of context in guessing new words meaning. Moreover, Longer retention was achieved when new words were partially familiar and when combination of LPS (inferring and consulting) was used. 

Fraser (1999) also mentions the controversial role of the think aloud protocol, which might have acted as a mean of learning involvement (Ellis, 1994; Pressley & Afflerbach, 1995). This is particularly interesting if we consider a sociocultural perspective to understand levels of processing, since authors such as Vygotsky (1986) and Swain et al. (2011) recognize the role of “Languaging” as a strategy of gaining conscious control over mental processes.

Consequently, Fraser interprets these finding as evidence of the incremental nature of vocabulary and the effectiveness of elaborate processing in L2 learning, since it engages deep semantic processing and prior knowledge, even in long term.

On that same token, and in a study that resonates with Willoughby’s et al. (1994) on the role of background knowledge, Brancroft (2002) warns us to use elaborative processing primarily with partially known words, since simultaneous processing of new items’ form and meaning would exhaust learners, hindering learning.
Based on his own previous research (2000), which found that sentence composing as an elaborative strategy interferes the learning of new words’ form, Barcroft is skeptical of the positive effects of elaborative processing, and criticizes the way semantic processing has been operationalized in studies advocating the construct. 

In his 2002 research Barcroft uses the TOPRA model to verify and explain why semantic processing inhibits structural processing, thus leading to no recall of new word form. The research consist of an experimental design of 3 groups of university students participants exposed to 3 different conditions: semantic, where participants learn real Spanish words by manipulating meaning; structural and control, where they focus on form; and no processing, where participants were asked to learn a list a words the best way possible, in other words, rote rehearsal.

The research showed that structural condition had a significant effect in learning new words, whereas semantic condition had most significant impact in learning partially known L2 words. Nevertheless, participants who engaged in the semantic condition had great difficulties recalling new words form, thus verifying the inhibitory effect of semantic elaboration on learning new items.

Although Barcroft’s rigorous study provides important considerations about the best time to using elaborate processing, his study fails in giving insight on the long-term effect of the TOPRA model and its predictions regarding L2 vocabulary learning.

Teaching implications regarding Elaborative Processing


As it is shown in the literature review above, and very much like any other teaching strategy, using elaborative processing to teach second language vocabulary seem potentially beneficial and effective in the long term, although at the same time complex and full of consideration that must carefully taken into account. In the following section, I will collect all the insights we gained from the studies on Elaborative Processing and review the teaching implications these pose for different aspects of the teaching labour.


Activities and tasks


It is key to set clear goals in vocabulary teaching in order to choose the appropriate focus (semantic or structural) and choose the corresponding assessment.

Educators must keep in mind that learners have a limited capacity of processing during tasks (Barcroft, 2002), therefore, tasks must be carefully chosen. It is important to pay attention to whether tasks are relevant and appropriate to our goal or focus. As Barcroft argues, teacher must let learners focus their attention of lexemes during the first encounters with a new word, in order to avoid processing exhaust. Activities and tasks related to word form are most appropriate for learning new items. On further encounters with the same lexical item, more semantic level processing is beneficial. For example, sentence composing provides learners with more opportunities for deep level processing, and therefore more effective (short-term) learning (Coomber, 1986).


Context and prior knowledge


Since elaborative processing is very dependent on background or prior knowledge (Barcroft, 2002; Willoughby et al., 1994), context is key in activating schemata and thus facilitating elaborative processing. In semantic level processing, learners must engage in lexical item analysis that require the use of previous experiences and knowledge; therefore, learning tasks should make learners build connections and make use of all their knowledge and experiences. However, in those situations where prior knowledge is limited, strategies such as rote learning and imagery-based ones might result more effective in first encounters with lexical items or beginner students with an incipient lexicon.


Time to learn and assess


When using elaborative processing, teachers must allocate time to learn. Since, as explained by Benjamin and Bjork (2000) elaborative processing is very time consuming and mentally demanding.

Additionally, teachers must be aware that learners might need more time to retrieve items as well. This means, for example, that if we test something they learned through elaborative rehearsal, they might need more time to come up with the correct answer. Therefore, it is important to allocate thinking time when testing.


Enhancing long-term learning


Due to the incremental nature of vocabulary and in order to move from lexeme level processing to lemma or semantic level processing, teachers need to constantly recycle lexical items. This way, in first encounters with a word, learners develop the core knowledge and in further encounters eventually they will develop more encyclopaedic knowledge, moving in size and depth of vocabulary learning (Schmitt, 2000).


The role of creativity


Following Schmeck’s (1991) suggestions, educators should encourage learners to appropriate information; in other words, involving themselves in the process of learning new items. One way to achieve this might be to have learners compose texts in which they merge their own experiences or expectations with new information, for instance, by composing sentences related to themselves using the new vocabulary, or restating new information with their own words, as in paraphrasing.


Complementing Instruction-based Learning and Incidental Learning


As teachers we are not always able to expose our learners to the ideal amount of vocabulary, nor to provide all the necessary encounters with a word. This is why it is key to complement formal instruction with incidental learning. 

As Fraser (1999) points out, teaching learners some strategies that might help them become more autonomous might open a huge door to learning from the environment. For example, in Foreign Language Learning, where everyday context does not provide enough input, teaching LPS explicitly might enhance incidental learning through reading authentic material or graded texts for pleasure.

Finally, It depends on educators to build up a extensive repertoire of strategies to be able to better address different learners, with different styles, preferences, capacities, background knowledge and experiences.


Conclusion


This paper has reviewed and briefly evaluated  the available literature regarding evidence and limitations of Elaborative Processing in Second Language Vocabulary Learning.

It is clear that the use of Elaborative Processing in Teaching Second Language vocabulary does report benefits, although it is also clear that educators need to be cautious as to when and how to incorporate this strategy in their lessons or teaching material.

Although knowledge about Elaborative Processing Strategies so far might enable educators to use it successfully in teaching, there seems to be more room for further studies that discover the nuances of these strategies and the multiple ways learners can benefit from it.


References


Anderson, J. R. (1990). Cognitive psychology and its implications (3rd ed.). New York: Freeman

Atkinson, R. C. & Shiffrin, R. M. (1968). "Chapter: Human memory: A proposed system and its control processes". In Spence, K.W. & Spence, J.T.. The psychology of learning and motivation (Volume 2). New York: Academic Press. pp. 89–195.

Barcroft, J. (2002). Semantic and structural elaboration in L2 lexical acquisition. Language learning, 52(2), 323 - 363

Benjamin, A. S., & Bjork, R. A. (2000). On the relationship between recognition speed and accuracy for words rehearsed via rote versus elaborative rehearsal. Journal of Experimental Psychology. Learning, Memory, and Cognition, 26(3), 638-648. 

Bobrow, S. & Bower, G. H. (1969). Comprehension and recall of sentences. Journal of Experimental Psychology, 80, 455-461.

Bower, G. H., Black, J. B., & Turner, T. J. (1979). Scripts in memory for text. Cognitive psychology, 11, 177-220.

Coomber, J. E.; Ramstad, D. A. and Sheets, D. R. (1986). Elaboration in Vocabulary Learning: A Comparison of Three Rehearsal Methods. Research in the teaching of English, 20(3), 281 - 293

Craik, F. I. M., & Lockhart, R. S. (1972). Levels of processing: A framework for memory research. Journal of Verbal Learning and Verbal Behavior, 11(6), 671–684.

Craik, F. I. M. & Tulving, E. (1975) Depth of processing and retention of words in episodic memory, Journal of Experimental Psychology: General, 104, pp. 268-294.

DiVesta, F.J., & Peverly, S. T. (1984). The effects of encoding variability, processing activity, and rule-examples sequence on the transfer of conceptual rules. Journal of Educational Psychology, 76, 108-119.

Ellis, N. C. (1994). Consciousness in second language learning: Psychological perspectives on the role of conscious processes in vocabulary acquisition. AILA Review, 11, 37–56.

Emig, J. (1977). Writing as a mode of learning. College Composition and Communication, 28, 122-128.

Fraser, C. A. (1999) Lexical Processing Strategy Use And Vocabulary Learning Through Reading. Studies in Second Language Acquisition, 21, 225-241

Fiske, S. T. (1984). Social cognition. New York: McGraw-Hill. 

Pressley, M., & Afflerbach, P. (1995). Verbal protocols of reading: The nature of constructively responsive reading. Hillsdale, NJ: Erlbaum.

Rosenberg, S. & Schiller, W. J. (1971)  Semantic coding and incidental sentence recall. Journal of Experimental Psychology, 90, 345-346.

Schmeck, R.R., Ribich, F.D. & Ramanaiah, N. (1977) Development of a self-report inventory for assessing individual differences in learning processes, Applied Psychological Measurement, 1, pp. 413-431.

Schmeck, R. R., Geisler-Brenstein, G. &  Cercy, S. P. (1991): Self-Concept and Learning: the revised inventory of learning processes, Educational Psychology, 11:3-4, 343-362

Schmitt, N. (2000) Vocabulary in language teaching. Cambridge: Cambridge University Press

Swain, M., Kinnear, P., Steinman, L. (2011). Sociocultural theory in second language education: an introduction through narratives. Toronto: multilingual matters.

Thomas, M. H. & Wang, A. Y. (1996). Learning by the keyword mnemonic: looking for long-term benefits. Journal of experimental psychology: applied. 2(4), 330-342.

Thomas, M. H. & Wang, A. Y. (1992). The Effect of Imagery-Based Mnemonics on the Long-Term Retention of Chinese Characters. Language learning, 42(3), 359-376.

Thorndyke, P. W., & Yekovich, F. R. (1980). A critique of schema-based theories of human story memory. Poetics, 9, 23-49

Trufello, I., Pérez, F., Aspillaga, E., Catalán, L. (2001) Infoeduca: Contenidos educacionales evaluados, que potencien masivamente el proceso de aprendizaje significativo a través de Internet. presented in Taller Internacional de Sofware Educativo 2001, Chile. Retrived from http://www.c5.cl/ieinvestiga/i_actas.htm

Vygotsky, L. S. (1986). Thought and word. (A. Kozulin, trans.) Cambridge, MA: The MIT press.


Willoughby, T., Wood, E., & Khan, M. (1994). Isolating variables that impact on or detract from the effectiveness of elaboration strategies. Journal of Educational Psychology, 86(2), 279-289.