KDD Laboratory
Goals
In the Knowledge Discovery in Databases (KDD) Laboratory, we focus on analysis of single documents (e.g. metadata extraction, classification code assignment) and the entire collections (e.g. author name disambiguation, bibliometrics). Typically, we employ state-of-the-art machine learning techniques to address our research problems and verify our approaches on vast collections of documents handled by CeON (over 20 million metadata records, over 7.5 million full texts).
Our interest in the above topics is driven by our involvement in national and European R&D projects, as well as by our internal needs (desired features of solutions created by the software development team at CeON).
The expected output of the research activities is threefold:
- Publications — innovative results of our efforts are presented in relevant conferences and journals (see list of our publications).
- Prototypes — software prototypes are released, on open-source licenses, to the general public and are integrated into products by the software development team at CeON (see our solutions and our code repository).
- APIs — machine access, through web services, to our algorithms and metadata will be provided.
Projects
The research unit at CeON is currently participating in four large-scale projects.
EuDML: The European Digital Mathematics Library aims at delivering an innovative framework for access and exploitation of Europe's rich heritage of mathematics. The project is partially funded by the Competitiveness and Innovation Framework Programme of the European Commission. Our team is involved with work packages WP7 and WP8 of this project.
OpenAIRE+: A continuation of the Open Access Infrastructure for Research in Europe (OpenAIRE), with a goal to develop an open access, participatory infrastructure for scientific information, funded by the Seventh Framework Programme of the European Commission. We are the leader of work package WP7 of the project.
POLON: A project commissioned by the Polish Ministry of Science and Higher Education, partially funded by the Operational Programme Human Capital. The goal is to create an information system for higher education. Our team is involved with subtask 25.1 of the project.
SYNAT: System Nauki i Techniki aims at creating an open hosting and communication platform for science, education and open knowledge society. It is a scientific project commissioned by the Polish National Centre for Research and Development (NCBiR). Our team is involved with the A3 phase of this project.
People
Our team consists of MSc students, PhD students and post-docs in computer science and mathematics. Members:
- Dr. Łukasz Bolikowski
- Artur Czeczko
- Piotr Jan Dendek
- Tomasz Kuśmierczyk
- Michał Łukasik
- Michał Siemiończyk
- Dominika Tkaczyk
We are constantly expanding and looking for talented researchers.