Semantic analysis of the user queries in the Croatian Historical Newspapers Portal log file
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
A historical newspapers digital library contains the basic elements and relations of a digital library, but also some special features and functions necessary for processing its content - huge amount of digitised pages with complex granularity and large amount of (historical) text. Descriptive metadata of newspapers are usually available on the title level only and the full-text search depends on the accuracy of OCR as well as orthographic and semantic issues of historical text. The characteristics of the historical newspapers digital library content pose a challenge for information retrieval and give rise to the following questions: How do users search digitised historical newspapers?, How do they formulate their search queries?, What topics are they looking for?, How do they deal with historical text issues?. Data stored in the historical newspapers digital library search logs can provide some of the answers and help to improve information access. The paper reports on the results of the semantic analysis of the Croatian Historical Newspapers Portal user queries.
Description
Keywords
Citation
Allen, Robert B.; I. Waldstein; W. Zhu. Automated processing of digitized historical newspapers : identification of segments and genres. // International Conference on Asian Digital Libraries, Hanoi, Vietnam, 2008. Pp. 380-387. Available at: http://boballen.info/PAPERS/NewsGenres.pdf
Allen, Robert B.; John Schallow. Metadata and data structure for the historical newspapers digital library. // Proceedings of the 8th international conference on Information and knowledge management CIKM 99. Available at: http://boballen.info/PAPERS/META/meta.pdf
Allen, Robert B. Improving access to digitized historical newspapers with text mining, coordinated models, and formative user interface design. // IFLA Newspaper Section Meeting, New Delhi, February 2010. Available at: http://boballen.info/RBA/PAPERS/IFLA2010/iflaDelhi.pdf
Ahonen, Eeva; Eero Hyvönen. Publishing historical texts on the semantic web: a case study. Available at: http://www.seco.tkk.fi/publications/2009/ahonen-hyvonen-historical-texts-2009.pdf
Bates, Marcia; Deborah N. Wilde; Susan Siegfried. An analysis of search terminology used by humanity scholars: The Getty Online Searching Project Report Number 1. // Library Quarterly 61,1(1993), 61-82.
Broder, Andrei. A taxonomy of web search. // SIGIR Forum. 36, 2(2002). Available at: http://www.cis.upenn.edu/~nenkova/Courses/cis430/p3-broder.pdf
Ćosić, Stjepan. Hrvatska traži povrat 765 fondova i zbirki. // Hrvatsko slovo, March 27, 2009.
Definition of the CIDOC conceptual reference model / editors Nick Crofts et al. ; produced by the ICOM/CIDOC Documentation Standards Group, continued by the CIDOC CRM Special Interest Group. Version 5.0. December 2008.
Available at: http://www.cidoc-crm.org/docs/cidoc_crm_version_5.0_Dec08.pdf
De Jong, Francisca; Henning Rode; Djoerd Himjestra. Temporal Language Models for the Disclosure of Historical Text. Available at: http://eprints.eemcs.utwente.nl/7266/01/db-utwente-433BCEA2.pdf
Europeana : an evaluation of users, usage and information seeking behaviour derived from web-server log-files (October 2009-April 2011). Available at: http://ciber-research.eu/download/20110821-M3.1.2_eConnect_LogAnalysis.pdf
FAST - Faceted Application of Subject Terminology.
Available at: http://oclc.org/research/activities/fast.html
Functional Requirements for Bibliographic Records, Final Report / IFLA Study Group on the Functional Requirements for Bibliographic Records. München : K.G. Saur, 1998. (UBCIM Publications, New Series ; v. 19). Available at: http://www.ifla.org/VII/s13/frbr/frbr.pdf
Gill, Tony. Building semantic bridges between museums, libraries and archives: The CIDOC Conceptual Reference Model. // First Monday: peer reviewed journal on the Internet. 9, 5(May 2004). Available at: http://firstmonday.org/issues/issue9_5/gill/index.html
Han, Hyejung; Wooseob Jeong; Dietmar Wolfram. Log analysis of academic digital library: user query patterns. // iConference 2014 Proceedings. Pp. 1002-1008.
Han, Myung-Ja. Metadata with levels of description: new challenges to catalogers and metadata librarians. // International Federation of Library Associations, World Library and Information Congress, Helsinki, 2012. Available at: http://conference.ifla.org/past-wlic/2012/80-han-en.pdf
Impact project (Improving Access of Text). Available at: http://www.impact-project.eu/
Jansen, B. J.; A. Spink; T. Saracevic. Real life, real users, and real needs: a study and analysis of user queries on the Web. // Information Processing and Management 36(2000), 207−227.
Jansen, Bernard J.; Amanda Spink. How are we searching the world wide web? A comparison of nine search engine transaction logs. // Information Processing and Management 42(2006), 248-263.
Jansen, Bernard J.; Danielle Booth. Classifying Web queries by topic and user intent. CHI 2010: Work-in-Progress, April 14–15, 2010, Atlanta, GA.
Available at: http://faculty.ist.psu.edu/jjansen/academic/jansen_user_intent.pdf
Jansen, Bernard J.; Danielle L. Booth; Amanda Spink. Determining the informational, navigational, and transactional intent of Web queries. // Information Processing and Management: an International Journal archive. 44, 3(2008), 1251-1266.
Jones, Alison. The many uses of newspapers.
Available at: http://dlxs.richmond.edu/d/ddr/docs/papers/usesofnewspapers.pdf
Jones, Steve; Sally Jo Cunningham; Rodger McNab; Stefan Boddie (2000). A transaction log analysis of a digital library. // International Journal on Digital Libraries 3(2000), 152–169.
Klarin Zadravec, Sofija. Koncept funkcionalne granularnosti u organizaciji informacija digitalne knjižnice = A concept of functional granularity in digital library information organisation: doktorski rad. Zagreb : Sveučilište u Zagrebu, Filozofski fakultet, 2012.
Petras, Vivien; Ray R. Larson; Michael Buckland. Time period directories: a metadata infrastructure for placing events in temporal and geographic context. // Joint Conference on Digital Libraries - JCDL Workshop, 2006. Available at: http://metadata.sims.berkeley.edu/tpdJCDL06.pdf
Portal Stare hrvatske novine = Croatian Historical Newspapers Portal. Available at: http://dnc.nsk.hr/newspapers/Default.aspx
Spink, Amanda; Dietmar Wolfram; B. J. Jansen; Tefko Saracevic. Searching the Web: the public and their queries. // Journal of the American Society for Information Science and Technology 52, 3(2001), 226-234.
Smolczewska Tona, Agnieszka. Combining web analytics and computational linguistics to enhance access to digital libraries: a case study. // Biblioteki, informacja, ksiąžka: interdiscyplinarne badania i praktyka w XXI wieku, 7(2010), 264-278.
Available at: http://skryba.inib.uj.edu.pl/wydawnictwa/e07/n-tona.pdf
Tibbo, H. R. Abstracts, online searching, and the humanities: an analysis of the structure and content of abstracts of historical discourse. PhD thesis, University of Maryland, 1989.
Zarndt, Frederic; Brian Geiger; Robert Stauffer; Alyssa Pacy; Meredith Palmer; Joanna DiPasquale. Digital collections: if you build them, will they visit? // International Federation of Library Associations, World Library and Information Congress, Singapore, 2013. Available at: http://www.dlconsulting.com/wp-content/uploads/2013/10/2013-IFLA-Satellite-Zarndt-et-al-Marketing-cultural-heritage-digital-collectionsedt1.pdf
Zavalina, Oksana. Collection-level user searches in federated digital resource environment. // Proceedings of the 70th ASIS&T Annual Meeting (Milwaukee WI, Oct. 18-25, 2007). Available at: http://hdl.handle.net/2142/8983