Treść książki

Przejdź do opcji czytnikaPrzejdź do nawigacjiPrzejdź do informacjiPrzejdź do stopki
1.4.DiachroniccorporaofEnglish
Table1.1.PeriodscoveredintheHelsinkiDiachronicCorpus*
Words
Percent
2,190
0.5
92,050
22.3
251,630
60.9
67,380
16.3
413,250
100.0
MiddleEnglish
113,010
18.6
97,480
16.0
184,230
30.3
213,850
35.1
608,570
100.0
EarlyModernEnglish
190,160
34.5
189,800
34.5
171,040
31.0
551,000
100.0
Total
1,572,820
*Source:http://www.lancs.ac.uk/staff/xiaoz/papers/corpus%20survey.htm#–Toc92298896.
Period
OldEnglish
Subperiod
I.1850
II.8501950
III.95011050
IV
.105011150
total
I.115011250
II.125011350
III.135011420
IV
.142011500
total
I.150011570
II.157011640
III.164011710
total
23
Overall
413,250
(26.27%)
608,570
(38.70%)
551,000
(35.03%)
100.00%
Kytö(1996)notesthat,generallyspeaking,theselectionalcriteriafor
includingatextinthediachronicpartofthecorpusreflecttheprinciplesofanalysis
takingintoaccountsocio-historicalvariation.Thetextswereselectedtheway
thatthelanguagewritteninaspecificperiodcouldbecoveredrepresentatively.
Furthermore,whatwasofprimaryimportanceintheconstructionofthecorpus
wastheperiodisation,butimportantwerealsosuchcriteriaasgeographical
dialect,typeandregisterofwriting,andsociolinguisticvariation.
TheDictionaryofOldEnglishCorpusinElectronicForm(DOEC)
Kohonen(2007)observesthattheDictionaryofOldEnglishCorpus
(DOEC)isanimpressivedatabasewithpracticallyallOldEnglishwritings
whichwehaveinherited,anditcomprises3.5millionwords.TheDOEC,
firstreleasedin1981,isapioneerworkintheapplicationoftechnologyto
lexicography.AsKytö(2010:37)notes,itis“anearlyexampleofaperiod-
specificcorpus(c.60011150),whichcomprisesallextantOldEnglishtexts
(3,060ofthem).[…]This3.5-million-wordcorpusisarareexampleof
aparticularlyexhaustiverecordofthetextualbasispreservedfromapastperiod.
Itisveryseldompossibleforacorpustoclaimexhaustivecoverageowingto
thewealthandlengthoftextsthathavecomedowntousfrommediaevaltimes
onwards.”TheprojecthasbeencarriedoutunderthedirectionofeditorsAngus