Resumo: | This thesis presents two analyses: first the analysis of computer corpora from undergraduate textbooks to isolate the (American) English language of science and technology they present; secondly an analysis of the English language competence of undergraduates starting their university studies in science and technology. These two analyses are contrasted in order to apply the results to the design of an English language syllabus for first year undergraduates. A frequency and range word list was produced using a large baseline corpus to contrast with the main corpora taken from physics and chemistry textbooks on the students’ bibliographies as a resource for syllabus design. Secondly, four corpora, two main and two sub-corpora produced from the physics and chemistry textbooks on the bibliographies of the undergraduates were analysed using Biber’s (1988) algorithms and functions for variation across speech and writing. The student intake was tested over five years and the results of those tests analysed. It was found that there was considerable variation in the students’ levels of language competence. However, there was a close correlation between the students’ competence and the number of years they had studied English in secondary school. Nevertheless there were students with extremely advanced competence and some with little or no competence in English amongst the undergraduates. Comprehension of scientific texts was generally found to correlate with more advanced competence and more years of study. The frequency and range word list showed the contexts which are appropriate for materials to be used with these students and demonstrated variation from many of the accepted views of the language of science and technology. The computer corpora analyses varied from Biber’s academic prose category. The sub-corpora demonstrated greatest variation which is believed to be as a result of specific cultural and/or literary material in the analogies used in the textbooks. The heavy load of cultural background knowledge which the reader would need in order to work with the textbooks adequately was also found in the exercises the students were supposed to use for practice on the topic presented in the chapter. This and the interpretation of visuals in the textbooks were considered to be two principle factors that needed to be emphasised in a syllabus for first year undergraduates. However, given the time constraints on language teaching for science and technology students, a methodology which would lead to greater student autonomy is suggested using computer corpus-based studies - data- viii driven learning and computer-supported distance communications and learning.
|