Treść książki

Przejdź do opcji czytnikaPrzejdź do nawigacjiPrzejdź do informacjiPrzejdź do stopki
HumanitiesandBigData.ExploitingDigitalArchivesintheAgeofAbundance
23
Itisworthnotingthatthemajorityofpastfulltextanalysisofliteraryworks
wereinmostcasesquitelimitedinscope(whiledeliveringinterestingandsi-
gnificantresults,suchwasthecaseofe.g.SimilarDiversityproject,seeFig.2)
mostlyduetodifcultieswithobtainingthedataandwithdifcultieswithpro-
cessingthedatacorpora.Firstproblemiscurrentlymostlymitigatedthanks
toaforementioneddigitizationeńorts;andthesecondonecanbetackledby
exploitingrentedcomputinginfrastructureoutlinedinpreviouschapter.
Fig.2.HolyscripturesvisualizationdoneforSimilarDiversityproject(http://similardiversity.net)
Suchanalysisthatcouldbedonemaybeprobablycomparedtotraditional
datamining(orcurrently“datascience”)eńorts,associatedwithbusinessin-
telligencetrendthatemergedintheindustryinthenineties.Inthisapproach
variousanalyticaltools,mostlyrelatedtomachinelearningfieldwereusedin
ordertodiscovertrendsintransactionaldatathatareimpossibletodetectjust
byobservingindividualtransactionsorobservations2.Similarapproachispo-
ssiblealsoinrelationtodatausedbyhumanists.AnexamplemightbeaPolish
universitiesconnectivityanalysis,carriedoutwithinSYNATresearchproject
ofNCBiR.InthisanalysisthedatabaseofPolishInformationProcessingCen-
trewasanalyzedandvisualized.Tisdatabasecontainsinformationabout
2Transactionsbeinginmostcasesindividualpurchasesofproductsorobservationsofuser
behaviorinonlinesystems.