The Automation of the content filling for JDS system

7 Jul 2016, 16:00
15m
406A

406A

Sectional reports 4. Scientific, industry and business applications in distributed computing systems 10. Databases, Distributed Storage systems, Big data Analytics

Speaker

Ms Tatiana Zaikina (JINR, LIT)

Description

The JINR Document Server (JDS – jds.jinr.ru), has been launched and developed in the framework of Open Access Initiative. Open Access (OA) in science is a way to collect and preserve the intellectual output of scientific organization and disseminate it all over the world. JDS possesses a digital library functionality which is provided by the software Invenio. It covers all the aspects of the modern digital library management. JDS includes collections of video lectures for young scientists, posters, audio lectures and news about JINR. In the future JDS is considered as part of the JINR corporate information system. The filling of content for such a big repository is a hard work that takes a lot of time. The goal of our activity is the maximal automation of the information system filling. As part of the work for the automation of filling the content, two applications have been developed: • The application for the Grants collection for authority records, that allows one to gather information from the official JINR website. The application is based on the Web Scraping technology (technology of data from web pages) for a subsequent download at the JINR DOCUMENT SERVER Information System (JDS). Work is made on the base of jds-test3 testing server. • The application for the Preprints collection for bibliographic records. The methods of automating and collecting information, current and planned functionality of the system are presented in the report.

Primary authors

Presentation materials