Data mining for scholarly journals: challenges and solutions for libraries

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

As our global knowledge environment changes and the information to be found in scholarly journals becomes increasingly available in digital format, it is necessary to employ more and more sophisticated search and retrieval procedures to mine this knowledge. We have large holes in our globally accessible knowledge base as traditional web-crawlers cannot collect and assess all of the serially produced papers, articles and journals that exist. Many search engines only touch the surface and they cannot harvest potentially valuable information in the silos of the “deep web”. More comprehensive data mining is therefore essential if we are to effectively tap the knowledge often hidden in scholarly journals and databases. Data-mining models are being developed which aim to search all the global knowledge being produced--an essential goal that will aid in sharing and therefore accelerating global knowledge diffusion. Deep Web Technologies and World Wide Science.org are examples of ongoing efforts to assist in mining the rapidly increasing mass of serially produced scientific information. Knowledge can only be shared, advanced and accelerated if it is accessible and as users expect libraries to be ever more effective in gathering and utilizing knowledge they must serve the global community by offering the best access to and analysis of all information. This paper intends to contribute to a more comprehensive understanding of what information is potentially available and how to access and analyze it using the latest methods of information retrieval.

Description

Keywords

Citation

Arnold, S. (January 30, 2012) Deep Web Technologies: Cracking Multilingual Search. Beyond Search. Retrieved from http://arnoldit.com/wordpress/2012/01/30/deep-web-technologies-cracking-multilingual-search/ Bergman, M.K. (Sept. 21, 2001) The Deep Web: Surfacing Hidden Value. Deep Content. Retrieved from http://brightplanet.com/wp-content/uploads/2012/03/12550176481-deepwebwhitepaper1.pdf About DeepDyve (2013) http://www.deepdyve.com Lederman, A. (2010) Breaking Down Language Barriers through Multilingual Federated Search. Information Services & Use. 30 125-132. Lee, C-H, & Yang, H-C (2000) Towards Multilingual Information Discovery through a SOM Based Text Mining Approach. PRICAI 2000 Workshop on Text and Web Mining. p.81. Oleinik, A. (2012, March 9). Publication Patterns in Russia and the West Compared. Scientometrics. pp. 533-551. Palmer, D. (2009) The Pacific Rim Library: A Surprising Pearl. Serials Review. Vol. 35, No. 3. pp.138-141. Pimienta, D. (2009) Twelve Years of Measuring Linguistic Diversity on the Internet: Balance and Perspective. UNESCO. Global Symposium on Promoting the Multilingual Internet. Retrieved from http://unesdoc.unesco.org/images/0018/001870/187016e.pdf Prado, D. Political and Legal Context. (2005) Measuring linguistic diversity on the Internet. UNESCO Institute for Statistics Montreal, Canada – UNESCO. Preimesberger, C. (2009) Wozniak Joins Another Company, This Time Search Engine DeepDyve. E-week. Retrieved from http://www.eweek.com/c/a/Search-Engines/Wozniak-Joins-Another-Company-This-Time-a-Search-Engine-849454/ Royal Society. (2011) Knowledge, Networks and Nations, Global Scientific Cooperation in the 21st Century. The Royal Society. Retrieved from: http://royalsociety.org/uploadedFiles/Royal_Society_Content/policy/publications/2011/4294976134.pdf Soria, C., Monachini, M., Bertegna, F., Calzolari, N., Huang, C-R., Hsieh, S-K., …Tesconi, M., (February 11 2009) Exploring Interoperability of Language Resources: the Case of Cross-lingual Semi-automatic Enrichment of Wordnets. Language Resources & Evaluation. 43:87-96 Wagner, C. & Wong, S. (2012) Unseen Science? Representation of BRICs in Global Science. Scientometrics. 90:1001-1013. Witt, A. et al. (2009, March) Multilingual Language Resources and Interoperability. Language Resources and Evaluation. Vol. 43, No. 1 1-14. Worldwide Science Alliance. WorldWideScience.org. About page. (2011) Retrieved from http://worldwidescience.org/about.html World Internet Penetration Rates-By Geographic Regions Q2 2012. Retrieved from http://www.internetworldstats.com/stats.htm Wright, A. (March 9, 2004) In Search of the Deep Web. Salon.com. Retrieved from http://www.salon.com/2004/03/09/deep_web.