Open Access Open Access  Restricted Access Subscription Access


Jing He, Joachim Quantz


This paper presents interactive knowledge visualization tools supporting knowledge workers in the process of curating digital content for exhibitions, showrooms, visitor centers or museums. The tools developed in the research project DKT (Digital Curation Technologies), funded by the Federal Ministry of Education and Research (BMBF), use language and knowledge technologies (such as information extraction, image recognition, classification and clustering) to automatically process digital multimedia content and then provide interactive visualizations of the results. The tools are thus not meant to replace knowledge workers but rather to support them and allow them to handle more content in a shorter span of time while maintaining or even increasing the quality of the curation process. Given this particular application scenario, the performance and accuracy of current state-of-the-art algorithms from Artificial Intelligence, though far from being perfect, is already good enough. The focus of the project work presented in this paper is on information extraction and text content.


Digital Curation; Knowledge Visualization; Information Extraction; Usability; User Experience; Knowledge Workers.

Full Text:



J. Piskorski, R. Yangarber, “Information extraction: past, present and future,” in T. Poibeau et al. (eds.), Multi-source, Multilingual Information Extraction and Summarization 11, Springer-Verlag Berlin Heidelberg, 2013, pp. 23-49.

TensorFlow: Image Recognition, Tutorial,

L. Pierson, Classification Algorithms Used in Data Science,

G. Seif, The 5 Clustering Algorithms Data Scientists Need to Know,

G. Rehm, J. He, J. Moreno-Schneider, J. Nehring, J. Quantz, “Designing user interfaces for curation technologies,” in 19th International Conference on Human-Computer Interaction – HCI International 2017. Vancouver, Canada, July, 2017.

J. He, N. Göhlsdorf, “Digital Curation Tool in the Age of Semantic Technology,” in C. Busch, C. Kassung, J. Sieck (Eds.), Kultur und Informatik: Mixed Reality 2017, pp. 193–208.

P. Bourgonje, J. Moreno-Schneider, G. Rehm, F. Sasaki, “Processing document collections to automatically extract linked data: semantic storytelling technologies for smart curation workflows,” in A. Gangemi, C. Gardent, (Eds.), Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web (WebNLG 2016), pp. 13–16.

Anjali et al., “Ambiguities in Natural Language Processing,” in International Journal of Innovative Research in Computer and Communication Engineering, Vol. 2, Special Issue 4, pp. 392-394, 2014.

Poesio et al., Anaphora Resolution: Algorithms, Resources, and Applications, Springer, 2016.

Q. Le, T. Mikolov, “Distributed Repre¬sentations of Sentences and Documents,”

R. S. Wills, “Google’s PageRank: The Math Behind the Search Engine,” May, 2006,

B. Perozzi, R. Al-Rfou, S. Skiena, “DeepWalk: Online Learning of Social Representations,” June 2014,

D. Bertsimas, J. Tsitsiklis, “Simulated Annealing,” Statistical Science, Vol. 8, Issue 1, pp. 10-15, 1993,

G. Lakoff, Women, Fire and Dangerous Things, University of Chicago Press, 1990.

A. Katifori et al. “Ontology visualization methods – a survey,”

J. F. Sowa, “Semantic Networks,”

Google: The Knowledge Graph,

D3.js: Data-Driven Documents,


  • There are currently no refbacks.