Scientific Work Using or Referencing ELKI

Over the years, ELKI has been increasingly cited/used in scientific publications and other software projects.

The following list is automatically generated from very heterogenous sources, and does contain errors. Where possible, we try to use metadata from DBLP,, SemanticScholar, and HTML meta headers from the publisher web pages. For theses, seminar articles etc. this approach does however not work. We have not verified every citation discovered by the bot.


  1. Abdulrahman H. Altalhi, José María Luna, M. A. Vallejo, and Sebastián Ventura (2017). Evaluation and comparison of open source software suites for data mining and knowledge discovery. Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 7(3), 10.1002/widm.1204, BibTeX
  2. Adam Byron (2017). Clustering and Network Analysis of Reverse Phase Protein Array Data. Molecular Profiling, 171-191, Springer, 10.1007/978-1-4939-6990-6_12
  3. Arno G. Stefani, Achim Sandmann, Andreas Burkovski, Johannes B. Huber, Heinrich Sticht, and Christophe Jardin (2017). Application of Methods from Information Theory in Protein-Interaction Analysis. Information- and Communication Theory in Molecular Biology, 293-313, Springer, 10.1007/978-3-319-54729-9_13
  4. Charu C. Aggarwal (2017). Applications of Outlier Analysis. Outlier Analysis, 399-422, Springer, 10.1007/978-3-319-47578-3_13
  5. Charu C. Aggarwal (2017). High-Dimensional Outlier Detection: The Subspace Method. Outlier Analysis, 149-184, Springer, 10.1007/978-3-319-47578-3_5
  6. Charu C. Aggarwal, and Saket Sathe (2017). Variance Reduction in Outlier Ensembles. Outlier Ensembles, 75-161, Springer, 10.1007/978-3-319-54765-7_3
  7. Jakub Sawicki, Maciej Smolka, Marcin Los, Robert Schaefer, and Piotr Faliszewski (2017). Two-Phase Strategy Managing Insensitivity in Global Optimization. EvoApplications (1), 266-281, 10.1007/978-3-319-55849-3_18, BibTeX
  8. Christian Beilschmidt, Thomas Fober, Michael Mattig, and Bernhard Seeger (2017). Quality Measures for Visual Point Clustering in Geospatial Mapping. W2GIS, 153-168, 10.1007/978-3-319-55998-8_10, BibTeX
  9. Lediona Nishani, and Marenglen Biba (2017). Randomizing Greedy Ensemble Outlier Detection with GRASP. CISIS, 974-983, Springer, 10.1007/978-3-319-61566-0_92, BibTeX
  10. Giannis Evagorou, and Thomas Heinis (2017). STATS - A Point Access Method for Multidimensional Clusters. DEXA (1), 352-361, Springer, 10.1007/978-3-319-64468-4_27, BibTeX
  11. Ankita Roy, Soumya Ray, and Radha Tamal Goswami (2017). Approaches and Challenges of Big Data Analytics—Study of a Beginner. Proceedings of the First International Conference on Intelligent Computing and Communication, 237-245, Springer, 10.1007/978-981-10-2035-3_25
  12. Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2017). The (black) art of runtime evaluation: Are we comparing algorithms or implementations?. Knowl. Inf. Syst. 52(2), 341-378, 10.1007/s10115-016-1004-2, BibTeX
  13. Junming Shao, Xinzuo Wang, Qinli Yang, Claudia Plant, and Christian Böhm (2017). Synchronization-based scalable subspace clustering of high-dimensional data. Knowl. Inf. Syst. 52(1), 83-111, 10.1007/s10115-016-1013-1, BibTeX
  14. Johannes Schneider, and Michail Vlachos (2017). Scalable density-based clustering with quality guarantees using random projections. Data Min. Knowl. Discov. 31(4), 972-1005, 10.1007/s10618-017-0498-x, BibTeX
  15. Klaus Arthur Schmid, Andreas Zufle, Tobias Emrich, Matthias Renz, and Reynold Cheng (2017). Uncertain Voronoi cell computation based on space decomposition. GeoInformatica, Springer, 10.1007/s10707-017-0293-2
  16. Mohamed Ben Khalifa, Rebeca P. Díaz Redondo, Ana Fernández Vilas, and Sandra Servia Rodríguez (2017). Identifying urban crowds using geo-located Social media data: a Twitter experiment in New York City. J. Intell. Inf. Syst. 48(2), 287-308, 10.1007/s10844-016-0411-x, BibTeX
  17. Kai Ming Ting, Takashi Washio, Jonathan R. Wells, and Sunil Aryal (2017). Defying the gravity of learning curve: a characteristic of nearest neighbour anomaly detectors. Machine Learning 106(1), 55-91, 10.1007/s10994-016-5586-4, BibTeX
  18. Seyed Morteza Mousavi, Aaron Harwood, Shanika Karunasekera, and Mojtaba Maghrebi (2017). Geometry of interest (GOI): spatio-temporal destination extraction and partitioning in GPS trajectory data. J. Ambient Intelligence and Humanized Computing 8(3), 419-434, 10.1007/s12652-016-0400-5, BibTeX
  19. Michalis Korakakis, Evaggelos Spyrou, Phivos Mylonas, and Stavros J. Perantonis (2017). Exploiting social media information toward a context-aware recommendation system. Social Network Analysis and Mining 7(1), Springer, 10.1007/s13278-017-0459-9
  20. Emre Güngör, and Ahmet Özmen (2017). Distance and density based clustering algorithm using Gaussian kernel. Expert Syst. Appl. 69, 10-20, 10.1016/j.eswa.2016.10.022, BibTeX
  21. William M. Trochim (2017). Hindsight is 20/20: Reflections on the evolution of concept mapping. Evaluation and Program Planning 60, 176-185, Elsevier BV, 10.1016/j.evalprogplan.2016.08.009
  22. Elyse Allender, and Tomasz F. Stepinski (2017). Automatic, exploratory mineralogical mapping of CRISM imagery using summary product signatures. Icarus 281, 151-161, Elsevier BV, 10.1016/j.icarus.2016.08.022
  23. Sirisup Laohakiat, Suphakant Phimoltares, and Chidchanok Lursinsap (2017). A clustering algorithm for stream data with LDA-based unsupervised localized dimension reduction. Inf. Sci. 381, 104-123, 10.1016/j.ins.2016.11.018, BibTeX
  24. Francesco Gullo, Giovanni Ponti, Andrea Tagarelli, and Sergio Greco (2017). An information-theoretic approach to hierarchical clustering of uncertain data. Inf. Sci. 402, 199-215, 10.1016/j.ins.2017.03.030, BibTeX
  25. Giuseppe Rizzo, Rosa Meo, Ruggero G. Pensa, Giacomo Falcone, and Raphaël Troncy (2017). Shaping City Neighborhoods Leveraging Crowd Sensors. Inf. Syst. 64, 368-378, 10.1016/, BibTeX
  26. Alvin Chiang, Esther David, Yuh-Jye Lee, Guy Leshem, and Yi-Ren Yeh (2017). A study on anomaly detection ensembles. J. Applied Logic 21, 1-13, 10.1016/j.jal.2016.12.002, BibTeX
  27. Dominik Sacha, Michael Sedlmair, Leishi Zhang, John A. Lee, Jaakko Peltonen, Daniel Weiskopf, Stephen C. North, and Daniel A. Keim (2017). What you see is what you can change: Human-centered machine learning by interactive visualization. Neurocomputing, Elsevier BV, 10.1016/j.neucom.2017.01.105
  28. Marcin Los, Jakub Sawicki, Maciej Smolka, and Robert Schaefer (2017). Memetic approach for irremediable ill-conditioned parametric inverse problems. ICCS, 867-876, Elsevier, 10.1016/j.procs.2017.05.007, BibTeX
  29. Hannes Bitto, Beatrice Mörstedt, Sylvia Faschina, and Rolf-Dieter Stieglitz (2017). ADHS bei Erwachsenen. Ein dimensionales oder kategoriales Konstrukt?. Zeitschrift für Psychiatrie, Psychologie und Psychotherapie 65(2), 121-131, Hogrefe Publishing Group, 10.1024/1661-4747/a000311
  30. Ricardo de Souza Jacomini, David Correa Martins Jr., Felipe Leno da Silva, and Anna Helena Reali Costa (2017). GeNICE: A Novel Framework for Gene Network Inference by Clustering, Exhaustive Search, and Multivariate Analysis. Journal of Computational Biology 24(8), 809-830, 10.1089/cmb.2017.0022, BibTeX
  31. Weiyu Huang, and Alejandro Ribeiro (2017). Axiomatic hierarchical clustering given intervals of metric distances. ICASSP, 4227-4231, IEEE, 10.1109/ICASSP.2017.7952953, BibTeX
  32. Yoshiyuki Harada, Yoriyuki Yamagata, Osamu Mizuno, and Eun-Hye Choi (2017). Log-Based Anomaly Detection of CPS Using a Statistical Method. IWESEP, 1-6, IEEE, 10.1109/IWESEP.2017.12, BibTeX
  33. Wesin Alves, Daniel Martins, Ubiratan Bezerra, and Aldebaro Klautau (2017). A Hybrid Approach for Big Data Outlier Detection from Electric Power SCADA System. IEEE Latin America Transactions 15(1), 57-64, IEEE, 10.1109/TLA.2017.7827888
  34. David Ciechanowicz, Dominik Pelzer, Benedikt Bartenschlager, and Alois Knoll (2017). A Modular Power System Planning and Power Flow Simulation Framework for Generating and Evaluating Power Network Models. IEEE Transactions on Power Systems 32(3), 2214-2224, IEEE, 10.1109/TPWRS.2016.2602479
  35. Soongeol Kwon, Lewis Ntaimo, and Natarajan Gautam (2017). Optimal Day-Ahead Power Procurement With Renewable Energy and Demand Response. IEEE Transactions on Power Systems 32(5), 3924-3933, IEEE, 10.1109/TPWRS.2016.2643624
  36. Huawen Liu, Xuelong Li, Jiuyong Li, and Shichao Zhang (2017). Efficient Outlier Detection for High-Dimensional Data. IEEE Transactions on Systems, Man, and Cybernetics: Systems PP(99), 1-11, IEEE, 10.1109/TSMC.2017.2718220
  37. Jifu Zhang, Xiaolong Yu, Yaling Xun, Sulan Zhang, and Xiao Qin (2017). Scalable Mining of Contextual Outliers Using Relevant Subspace. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 1-15, IEEE, 10.1109/TSMC.2017.2718592
  38. Peter Bailis, Edward Gan, Samuel Madden, Deepak Narayanan, Kexin Rong, and Sahaana Suri (2017). MacroBase: Prioritizing Attention in Fast Data. SIGMOD Conference, 541-556, ACM, 10.1145/3035918.3035928, BibTeX
  39. Rocío B. Hubert, Ana G. Maguitman, Carlos I. Chesñevar, and Marcos A. Malamud (2017). CitymisVis. a Tool for the Visual Analysis and Exploration of Citizen Requests and Complaints. Proceedings of the 10th International Conference on Theory and Practice of Electronic Governance - ICEGOV ‘17, ACM Press, 10.1145/3047273.3047320
  40. Erich Schubert, Jörg Sander, Martin Ester, Hans-Peter Kriegel, and Xiaowei Xu (2017). DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN. ACM Trans. Database Syst. 42(3), 19:1-19:21, 10.1145/3068335, BibTeX
  41. Andrew Lensen, Bing Xue, and Mengjie Zhang (2017). GPGC: genetic programming for automatic clustering using a flexible non-hyper-spherical graph-based approach. GECCO, 449-456, ACM, 10.1145/3071178.3071222, BibTeX
  42. Daniyal Kazempour, Markus Mauder, Peer Kröger, and Thomas Seidl (2017). Detecting Global Hyperparaboloid Correlated Clusters Based on Hough Transform. SSDBM, 31:1-31:6, ACM, 10.1145/3085504.3085536, BibTeX
  43. Dominik Mautz, Wei Ye, Claudia Plant, and Christian Böhm (2017). Towards an Optimal Subspace for K-Means. KDD, 365-373, ACM, 10.1145/3097983.3097989, BibTeX
  44. Suhang Wang, Charu Aggarwal, and Huan Liu (2017). Randomized Feature Engineering as a Fast and Accurate Alternative to Kernel Methods. KDD, 485-494, ACM, 10.1145/3097983.3098001, BibTeX
  45. Wubai Zhou, Wei Xue, Ramesh Baral, Qing Wang, Chunqiu Zeng, Tao Li, Jian Xu, Zheng Liu, Larisa Shwartz, and Genady Ya. Grabarnik (2017). STAR: A System for Ticket Analysis and Resolution. KDD, 2181-2190, ACM, 10.1145/3097983.3098190, BibTeX
  46. Zhihua Li, Ziyuan Li, Ning Yu, and Steven Wen (2017). Locality-Based Visual Outlier Detection Algorithm for Time Series. Security and Communication Networks 2017, 1-10, Hindawi Limited, 10.1155/2017/1869787
  47. Changbo Ke, Zhiqiu Huang, Fu Xiao, and Linyuan Liu (2017). Privacy Data Decomposition and Discretization Method for SaaS Services. Mathematical Problems in Engineering 2017, 1-11, Hindawi Limited, 10.1155/2017/4785142
  48. 迟荣华, 程媛, 朱素霞, 黄少滨, and 陈德运 (2017). 基于快速高斯变换的不确定数据聚类算法. 通信学报 38(3), 101-111, 10.11959/j.issn.1000-436x.2017061
  49. Sen Wu, Xiaonan Gao, and Lu Liu (2017). ADJ-CABOSFV for High Dimensional Sparse Data Clustering. DEStech Transactions on Economics and Management, DEStech Publications, 10.12783/dtem/apme2016/8736
  50. Guillaume Casanova, Elias Englmeier, Michael E. Houle, Peer Kröger, Michael Nett, Erich Schubert, and Arthur Zimek (2017). Dimensional Testing for Reverse k-Nearest Neighbor Search. PVLDB 10(7), 769-780, 10.14778/3067421.3067426, BibTeX
  51. Burak Omer Saracoglu (2017). Location selection factors of small hydropower plant investments powered by SAW, grey WPM and fuzzy DEMATEL based on human natural language perception. International Journal of Renewable Energy Technology 8(1), 1, Inderscience Publishers, 10.1504/IJRET.2017.080867
  52. Onur Doğan (2017). Ücretsiz Veri Madenciliği Araçlari Ve Türkiye’De Bilinirlikleri Üzerine Bir Araştirma. Ege Stratejik Araştırmalar Dergisi 8(1), 77-93, 10.18354/esam.15352
  53. Jürgen Bernard, Eduard Dobermann, Michael Sedlmair, and Dieter W. Fellner (2017). Combining Cluster and Outlier Analysis with Visual Analytics. EuroVis Workshop on Visual Analytics (EuroVA), The Eurographics Association, 10.2312/eurova.20171114
  54. Yasser Abd Djawad, Andi Mu’nisa, Pangayoman Rusung, Abdi Kurniawan, Irma Suryani Idris, and Mushawwir Taiyeb (2017). Essential Feature Extraction of Photoplethysmography Signal of Men and Women in Their 20s. Engineering Journal 21(4), 259-272, Faculty of Engineering, Chulalongkorn University, 10.4186/ej.2017.21.4.259
  55. Linnea Passing, Manuel Then, Nina Hubig, Harald Lang, Michael Schreier, Stephan Günnemann, Alfons Kemper, and Thomas Neumann (2017). SQL- and Operator-centric Data Analytics in Relational Main-Memory Databases. EDBT, 84-95,, 10.5441/002/edbt.2017.09, BibTeX
  56. S Sathappan, S Sridhar, and D Tomar (2017). A Literature Study on Traditional Clustering Algorithms for Uncertain Data. British Journal of Mathematics & Computer Science 21(5), 1-21, Sciencedomain International, 10.9734/BJMCS/2017/32697
  57. Luisa Sanz-Martínez, Juan Alberto Muñoz-Cristóbal, Miguel L. Bote-Lorenzo, Alejandra Martínez-Monés, and Yannis A. Dimitriadis (2017). Toward Criteria-Based Automatic Group Formation in MOOCs. EMOOCs-WIP, 83-88,, BibTeX
  58. Chen Luo, and Anshumali Shrivastava (2017). Arrays of (locality-sensitive) Count Estimators (ACE): High-Speed Anomaly Detection via Cache Lookups. CoRR abs/1706.06664, BibTeX
  59. Jonathan R. Wells, and Kai Ming Ting (2017). A simple efficient density estimator that enables fast systematic search. CoRR abs/1707.00783, BibTeX
  60. Markus Mauder (2017). Analyzing complex data using domain constraints. Ludwig Maximilian University of Munich, Germany, BibTeX
  61. Aakash Ravi (2017). Machine learning-based identification of separating features in molecular fragments.
  62. Benjamin Heinzerling, Michael Strube, and Chin-Yew Lin (2017). Trust, but Verify! Better Entity Linking through Automatic Verification. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, 828–838, Association for Computational Linguistics
  63. Erich Schubert, Andreas Spitz, Michael Weiler, Johanna Geiß, and Michael Gertz (2017). Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding. arXiv 1708.03569
  64. Loïc Prieur-Drevon (2017). Structures de données hautement extensibles pour le stockage sur disque de séries temporelles hétérogènes. École Polytechnique de Montréal
  65. Lu Chen, Yunjun Gao, Baihua Zheng, Christian S. Jensen, Hanyu Yang, and Keyu Yang (2017). Pivot-based metric indexing. Proceedings of the VLDB Endowment 10(10), 1058-1069, VLDB Endowment
  66. Luisa Sanz Martínez, Juan Alberto Muñoz Cristóbal, Miguel L. Bote Lorenzo, Alejandra Martínez Monés, and Yannis A. Dimitriadis (2017). Automatic group formation in a MOOC based on students’ activity criteria (EC-TEL 2017). Springer
  67. Pallam Anusha, and G.Krishna Reddy (2017). Repeal Adjacent Neighbors In Untrue Interval Base Discovery System. IJITR 5(1), 5552-5554
  68. Thomas Ortner, Peter Filzmoser, Maia Zaharieva, Sarka Brodinova, and Christian Breiteneder (2017). Local projections for high-dimensional outlier detection. arXiv 1708.01550


  1. Joy Mustafi (2016). Natural Language Processing and Machine Learning for Big Data. Techniques and Environments for Big Data Analysis, 53-74, Springer, 10.1007/978-3-319-27520-8_4
  2. Johannes Blömer, and Kathrin Bujna (2016). Adaptive Seeding for Gaussian Mixture Models. PAKDD (2), 296-308, Springer, 10.1007/978-3-319-31750-2_24, BibTeX
  3. Amin Aghaee, Mehrdad Ghadiri, and Mahdieh Soleymani Baghshah (2016). Active Distance-Based Clustering Using K-Medoids. PAKDD (1), 253-264, Springer, 10.1007/978-3-319-31753-3_21, BibTeX
  4. Smita Chormunge, and Sudarson Jena (2016). Performance Efficiency and Effectiveness of Clustering Methods for Microarray Datasets. Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics, 557-567, Springer India, 10.1007/978-81-322-2529-4_58
  5. B. B. Gupta, Aakanksha Tewari, Ankit Kumar Jain, and Dharma P. Agrawal (2016). Fighting against phishing attacks: state of the art and future challenges. Neural Computing and Applications, Springer, 10.1007/s00521-016-2275-y
  6. Guilherme O. Campos, Arthur Zimek, Jörg Sander, Ricardo J. G. B. Campello, Barbora Micenková, Erich Schubert, Ira Assent, and Michael E. Houle (2016). On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min. Knowl. Discov. 30(4), 891-927, 10.1007/s10618-015-0444-8, BibTeX
  7. Ahmed Aleroud, and Aryya Gangopadhyay (2016). Multimode co-clustering for analyzing terrorist networks. Information Systems Frontiers, Springer, 10.1007/s10796-016-9712-4
  8. Bo Jiang, Fei-yue Qiu, and Li-ping Wang (2016). Multi-view clustering via simultaneous weighting on views and features. Appl. Soft Comput. 47, 304-315, 10.1016/j.asoc.2016.06.010, BibTeX
  9. Manal T. Adham, and Peter J. Bentley (2016). Evaluating clustering methods within the Artificial Ecosystem Algorithm and their application to bike redistribution in London. Biosystems 146, 43-59, 10.1016/j.biosystems.2016.04.008, BibTeX
  10. Felix Stahlberg, Tim Schlippe, Stephan Vogel, and Tanja Schultz (2016). Word segmentation and pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment. Computer Speech & Language 35, 234-261, 10.1016/j.csl.2014.10.001, BibTeX
  11. Bo Jiang, Fei-yue Qiu, Li-ping Wang, and Zhenjun Zhang (2016). Bi-level weighted multi-view clustering via hybrid particle swarm optimization. Inf. Process. Manage. 52(3), 387-398, 10.1016/j.ipm.2015.11.003, BibTeX
  12. Piotr Przybyla, Matthew Shardlow, Sophie Aubin, Robert Bossy, Richard Eckart de Castilho, Stelios Piperidis, John McNaught, and Sophia Ananiadou (2016). Text mining resources for the life sciences. Database 2016, 10.1093/database/baw145, BibTeX
  13. Shane Gero, Hal Whitehead, and Luke Rendell (2016). Individual, unit and vocal clan level identity cues in sperm whale codas. Royal Society Open Science 3(1), 150372, The Royal Society, 10.1098/rsos.150372
  14. Shane Gero, Anne Bøttcher, Hal Whitehead, and Peter Teglberg Madsen (2016). Socially segregated, sympatric sperm whale clans in the Atlantic Ocean. Royal Society Open Science 3(6), 160061, The Royal Society, 10.1098/rsos.160061
  15. Bo Jiang, Fei-yue Qiu, Shipin Yang, and Li-ping Wang (2016). Evolutionary multi-objective optimization for multi-view clustering. CEC, 3308-3315, IEEE, 10.1109/CEC.2016.7744208, BibTeX
  16. Khaled M. Fouad, and Mohamed Farouk Dawood (2016). Adaptive optimized clustering for Veterans’ Administration Lung Cancer. 2016 8th Cairo International Biomedical Engineering Conference (CIBEC), 90-93, IEEE, 10.1109/CIBEC.2016.7836127
  17. Wei Ye, Samuel Maurus, Nina Hubig, and Claudia Plant (2016). Generalized Independent Subspace Clustering. ICDM, 569-578, IEEE, 10.1109/ICDM.2016.0068, BibTeX
  18. Dominik Mautz, Christian Böhm, and Claudia Plant (2016). Subspace Clustering Ensembles through Tensor Decomposition. ICDM Workshops, 1225-1234, IEEE, 10.1109/ICDMW.2016.0177, BibTeX
  19. Yuan Cheng, Ronghua Chi, and Suxia Zhu (2016). An uncertain data model construction method based on nonparametric estimation. 2016 IEEE International Conference on Electronic Information and Communication Technology (ICEICT), 384-389, IEEE, 10.1109/ICEICT.2016.7879722
  20. Martin Jenckel, Syed Saqib Bukhari, and Andreas Dengel (2016). anyOCR: A sequence learning based OCR system for unlabeled historical documents. ICPR, 4035-4040, IEEE, 10.1109/ICPR.2016.7900265, BibTeX
  21. Xu Han, Chee Keong Kwoh, and Jung-jae Kim (2016). Clustering based active learning for biomedical Named Entity Recognition. IJCNN, 1253-1260, IEEE, 10.1109/IJCNN.2016.7727341, BibTeX
  22. Vinh Truong Hoang, Alice Porebski, Nicolas Vandenbroucke, and Denis Hamad (2016). LBP parameter tuning for texture analysis of lace images. 2016 International Image Processing, Applications and Systems (IPAS), 1-6, IEEE, 10.1109/IPAS.2016.7880063
  23. Josua Krause, Aritra Dasgupta, Jean-Daniel Fekete, and Enrico Bertini (2016). SeekAView: An intelligent dimensionality reduction strategy for navigating high-dimensional data spaces. LDAV, 11-19, IEEE, 10.1109/LDAV.2016.7874305, BibTeX
  24. Venkatesh Kulkarni, and Manju Nanda (2016). Data driven prognosis approach for safety critical systems. 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), 1699-1703, IEEE, 10.1109/RTEICT.2016.7808123
  25. MingJie Tang, Ruby Y. Tahboub, Walid G. Aref, Mikhail J. Atallah, Qutaibah M. Malluhi, Mourad Ouzzani, and Yasin N. Silva (2016). Similarity Group-by Operators for Multi-Dimensional Relational Data. IEEE Trans. Knowl. Data Eng. 28(2), 510-523, 10.1109/TKDE.2015.2480400, BibTeX
  26. Lei Xu, Chunxiao Jiang, Yong Ren, and Hsiao-Hwa Chen (2016). Microblog Dimensionality Reduction - A Deep Learning Approach. IEEE Trans. Knowl. Data Eng. 28(7), 1779-1789, 10.1109/TKDE.2016.2540639, BibTeX
  27. David Ciechanowicz, Dominik Pelzer, Benedikt Bartenschlager, and Alois Knoll (2016). A Modular Power System Planning and Power Flow Simulation Framework for Generating and Evaluating Power Network Models. IEEE Transactions on Power Systems 32(3), 2214-2224, IEEE, 10.1109/TPWRS.2016.2602479
  28. Xiaodan Hou, and Tao Zhang (2016). Unsupervised universal steganalyzer for high-dimensional steganalytic features. J. Electronic Imaging 25(6), 63016, 10.1117/1.JEI.25.6.063016, BibTeX
  29. Fabrizio Angiulli, and Fabio Fassetti (2016). Toward Generalizing the Unification with Statistical Outliers: The Gradient Outlier Factor Measure. TKDD 10(3), 27:1-27:26, 10.1145/2829956, BibTeX
  30. Erich Schubert, Michael Weiler, and Hans-Peter Kriegel (2016). SPOTHOT: Scalable Detection of Geo-spatial Events in Large Textual Streams. SSDBM, 8:1-8:12, ACM, 10.1145/2949689.2949699, BibTeX
  31. Hossein Hamooni, Biplob Debnath, Jianwu Xu, Hui Zhang, Guofei Jiang, and Abdullah Mueen (2016). LogMine: Fast Pattern Recognition for Log Analytics. CIKM, 1573-1582, ACM, 10.1145/2983323.2983358, BibTeX
  32. Apurva Narechania, Richard Baker, Rob DeSalle, Barun Mathema, Sergios-Orestis Kolokotronis, Barry Kreiswirth, and Paul J. Planet (2016). Clusterflock: a flocking algorithm for isolating congruent phylogenomic datasets. GigaScience 5(1), Oxford University Press (OUP), 10.1186/s13742-016-0152-3
  33. Piotr Andrzej Kowalski, Szymon Lukasik, Malgorzata Charytanowicz, and Piotr Kulczycki (2016). Clustering based on the Krill Herd Algorithm with Selected Validity Measures. FedCSIS, 79-87, 10.15439/2016F295, BibTeX
  34. Amit Verma, Iqbaldeep Kaur, and Amandeep Kaur (2016). Algorithmic Approach to Data Mining and Classification Techniques. Indian Journal of Science and Technology 9(28), Indian Society for Education and Environment, 10.17485/ijst/2016/v9i28/88874
  35. Ivano Verzola, Alessandro Donati, Jose Martinez, Matthias Schubert, and Laszlo Somodi (2016). Project Sibyl: A Novelty Detection System for Human Spaceflight Operations. 14th International Conference on Space Operations, American Institute of Aeronautics and Astronautics (AIAA), 10.2514/6.2016-2405
  36. Qingying Yu, Yonglong Luo, Chuanming Chen, and Weixin Bian (2016). Neighborhood relevant outlier detection approach based on information entropy. Intell. Data Anal. 20(6), 1247-1265, 10.3233/IDA-150301, BibTeX
  37. Huanyang Zheng, and Jie Wu (2016). Which, When, and How: Hierarchical Clustering with Human-Machine Cooperation. Algorithms 9(4), 88, 10.3390/a9040088, BibTeX
  38. Alejandro Rituerto, Henrik Andreasson, Ana C. Murillo, Achim J. Lilienthal, and José Jesús Guerrero (2016). Building an Enhanced Vocabulary of the Robot Environment with a Ceiling Pointing Camera. Sensors 16(4), 493, 10.3390/s16040493, BibTeX
  39. Merima Kulin, Carolina Fortuna, Eli De Poorter, Dirk Deschrijver, and Ingrid Moerman (2016). Data-Driven Design of Intelligent Wireless Networks: An Overview and Tutorial. Sensors 16(6), 790, 10.3390/s16060790, BibTeX
  40. V. Mahalakshmi, and M. Govindarajan (2016). Comparison of Outlier Detection Methods in Diabetes Data. International Journal of Computer Applications 155(10), 28-32, Foundation of Computer Science, 10.5120/ijca2016912451
  41. Gang Chen, Haiying Zhang, and Caiming Xiong (2016). Maximum Margin Dirichlet Process Mixtures for Clustering. AAAI, 1491-1497, AAAI Press, BibTeX
  42. Jeffrey Hudack, and Jae C. Oh (2016). Multi-Agent Sensor Data Collection with Attrition Risk. ICAPS, 166-174, AAAI Press, BibTeX
  43. Fatemeh Riahi, and Oliver Schulte (2016). Propositionalization for Unsupervised Outlier Detection in Multi-Relational Data. FLAIRS Conference, 448-453, AAAI Press, BibTeX
  44. Zhiruo Zhao, Chilukuri K. Mohan, and Kishan G. Mehrotra (2016). Adaptive Sampling and Learning for Unsupervised Outlier Detection. FLAIRS Conference, 460-466, AAAI Press, BibTeX
  45. Sebastian Bothe, and Tamás Horváth (2016). The Partial Weighted Set Cover Problem with Applications to Outlier Detection and Clustering. LWDA, 335-346,, BibTeX
  46. Peter Bailis, Deepak Narayanan, and Samuel Madden (2016). MacroBase: Analytic Monitoring for the Internet of Things. CoRR abs/1603.00567, BibTeX
  47. Weiyu Huang, and Alejandro Ribeiro (2016). Hierarchical Clustering Given Confidence Intervals of Metric Distances. CoRR abs/1610.04274, BibTeX
  48. Yannis Papanikolaou, Ioannis Katakis, and Grigorios Tsoumakas (2016). Hierarchical Partitioning of the Output Space in Multi-label Data. CoRR abs/1612.06083, BibTeX
  49. Johannes Schneider, and Thomas Locher (2016). Obfuscation using Encryption. CoRR abs/1612.03345, BibTeX
  50. Klaus Arthur Schmid (2016). Searching and mining in enriched geo-spatial data. Ludwig Maximilian University of Munich, Germany, BibTeX
  51. Michael Weiler (2016). Event detection in high throughput social media. Ludwig Maximilian University of Munich, Germany, BibTeX
  52. Simon Maag, and Hanspeter Kriesi (2016). Politicisation, conflicts and the structuring of the EU political space. Politicising Europe, Cambridge University Press, 9781107129412
  53. Ahmed Balfagih (2016). Direct Selling Business Lead Prediction by Social Media Data Mining.
  54. Anthony McCaffrey, and University Of Massachusetts (2016). Feature type spectrum technique.
  55. B. Gajewski, and T. Martyn (2016). Spatial data clustering in independent mobile environment. Measurement Automation Monitoring Vol. 62, No. 5
  56. Bruno Miguel Nunes da Silva (2016). Exploratory Cluster Analysis from Ubiquitous Data Streams using Self-Organizing Maps.
  57. Francisco Daniel Porras Bernárdez (2016). Extraction of User’s Stays and Transitions from GPS Logs: A Comparison of Three Spatio-Temporal Clustering Approaches.
  58. G. O. Campos, A. Zimek, J. Sander, R. J. G. B. Campello, B. Micenková, E. Schubert, I. Assent, and M. E. Houle (2016). On the Evaluation of Outlier Detection: Measures, Datasets, and an Empirical Study Continued. Proceedings of the LWDA 2016 Workshops: KDML, FGWM, FGIR, and FGDB, Potsdam, Germany
  59. Guansong Pang, Kai Ming Ting, David Albrecht, and Huidong Jin (2016). ZERO++: Harnessing the Power of Zero Appearances to Detect Anomalies in Large-Scale Data Sets. Journal of Artificial Intelligence Research 57, 593-620
  60. Hrvoje Brlečić Layer (2016). Klasifikacija energetskih subjekata u Republici Hrvatskoj korištenjem otkrivanja znanja iz baza podataka. University of Zagreb. Faculty of Economics and Business.
  61. Jakub Velkoborský (2016). Hierarchical visualization of the chemical space.
  62. Jeffrey Hudack (2016). Risk-Aware Planning for Sensor Data Collection. Syracuse University
  63. Joonas Puura (2016). Tarkvara loomine erinevate k-keskmiste algoritmide rakendamiseks (Software for Clustering Using k-means Algorithms).
  64. Justin Sam Chew, and Maurice HT Ling (2016). TAPPS Release 1: Plugin-Extensible Platform for Technical Analysis and Applied Statistics. Advances in Computer Science: an International Journal 5(1), 132-141
  65. Luca Putelli (2016). Estrazione di regole di associazione da dati RDF. Italy
  66. Martin Jenckel, Syed Saqib Bukhari, and Andreas Dengel (2016). Clustering Benchmark for Characters in Historical Documents. DAS 2016 Short Paper Booklet, 33-34, -
  67. Michael Siers, and Md Islam (2016). RBClust: High quality class-specific clustering using rule-based classification. ESANN
  68. Miguel José Cavadas Santos (2016). Automated Scalable Platform for Packet Traffic Analysis.
  69. Mingjie Tang (2016). Efficient processing of similarity queries with applications. Purdue University
  70. N. Srujana, G. Srinivasa Rao, and Dr. M. V. Sivaprasad (2016). Unsupervised Distance-Based Outlier Detection In High Dimensional Data. IJITR 4(5), 3905–3907
  71. P.A.R. Kostjens (2016). Anomaly Detection in Application Log Data.
  72. Parvej Aalam, and Tamanna Siddiqui (2016). Comparative study of data mining tools used for clustering. 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), 3971-3975, IEEE
  73. Pawel Lee (2016). Structure in Star Forming Regions. University of Sheffield
  74. Ravi Chinapaga, D. Sravya, M Bal Raju, and N Subhash Chandra (2016). Detecting Outliers Using Euclidean Distance In Unsupervised Method. IJITR 4(5), 3855–3857
  75. Stephen K Karanja (2016). Density-based Cluster Analysis Of Fire Hot Spots In Kenya’s Wildlife Protected Areas. University of Nairobi
  76. Talita de Souza Rampão (2016). Mineração de dados em bases jurídicas: um estudo de caso.
  77. Thomas Rusch, Kurt Hornik, and Patrick Mair (2016). Assessing and quantifying clusteredness: The OPTICS Cordillera. WU Vienna University of Economics and Business
  78. Tilmann Zäschke (2016). The PH-Tree Revisited r1.2.
  79. Trusina Jan (2016). Implementace evolučního shlukování. České vysoké učení technické v Praze. Vypočetní a informační centrum.


  1. Greg Hamerly, and Jonathan Drake (2015). Accelerating Lloyd’s Algorithm for k-Means Clustering. Partitional Clustering Algorithms, 41-78, Springer, 10.1007/978-3-319-09259-1_2
  2. Monika Kofler, Andreas Beham, Stefan Wagner, and Michael Affenzeller (2015). Robust Storage Assignment in Warehouses with Correlated Demand. Computational Intelligence and Efficiency in Engineering Systems, 415-428, Springer, 10.1007/978-3-319-15720-7_29, BibTeX
  3. Erich Schubert, Arthur Zimek, and Hans-Peter Kriegel (2015). Fast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles. DASFAA (2), 19-36, Springer, 10.1007/978-3-319-18123-3_2, BibTeX
  4. Taylor Arnold, and Lauren Tilton (2015). Image Data. Humanities Data in R, 113-129, Springer, 10.1007/978-3-319-20702-5_8
  5. Markus Mauder, Markus Reisinger, Tobias Emrich, Andreas Züfle, Matthias Renz, Goce Trajcevski, and Roberto Tamassia (2015). Minimal Spatio-Temporal Database Repairs. SSTD, 255-273, Springer, 10.1007/978-3-319-22363-6_14, BibTeX
  6. Lasanthi Heendaliya, Michael Wisely, Dan Lin, Sahra Sedigh Sarvestani, and Ali R. Hurson (2015). Influence-Aware Predictive Density Queries Under Road-Network Constraints. SSTD, 80-97, Springer, 10.1007/978-3-319-22363-6_5, BibTeX
  7. Tobias Emrich, Klaus Arthur Schmid, Andreas Züfle, Matthias Renz, and Reynold Cheng (2015). Uncertain Voronoi Cell Computation Based on Space Decomposition. SSTD, 98-116, Springer, 10.1007/978-3-319-22363-6_6, BibTeX
  8. Bo Zhu, Alexandru Mara, and Alberto Mozo (2015). CLUS: Parallel Subspace Clustering Algorithm on Spark. ADBIS (Short Papers and Workshops), 175-185, Springer, 10.1007/978-3-319-23201-0_20, BibTeX
  9. Pengjie Ren, Peng Liu, Zhumin Chen, Jun Ma, and Xiaomeng Song (2015). Learning Similarity Functions for Urban Events Detection by Mining Hotline Phone Records. APWeb, 411-423, Springer, 10.1007/978-3-319-25255-1_34, BibTeX
  10. Nadezhda Fedorova, Josep Blat, and David F. Nettleton (2015). Can Embedding Solve Scalability Issues for Mixed-Data Graph Clustering?. Euro-Par Workshops, 481-492, Springer, 10.1007/978-3-319-27308-2_39, BibTeX
  11. Tobias Emrich, Hans-Peter Kriegel, Peer Kröger, Johannes Niedermayer, Matthias Renz, and Andreas Züfle (2015). On reverse-k-nearest-neighbor joins. GeoInformatica 19(2), 299-330, 10.1007/s10707-014-0215-5, BibTeX
  12. Arthur Zimek, and Jilles Vreeken (2015). The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives. Machine Learning 98(1-2), 121-155, 10.1007/s10994-013-5334-y, BibTeX
  13. Heiko Paulheim, and Robert Meusel (2015). A decomposition of the outlier detection problem into a set of supervised learning problems. Machine Learning 100(2-3), 509-531, 10.1007/s10994-015-5507-y, BibTeX
  14. Daniel Avila, and Iren Valova (2015). RADDACL2: a recursive approach to discovering density clusters. Progress in AI 4(1-2), 21-36, 10.1007/s13748-015-0066-9, BibTeX
  15. Tamer F. Ghanem, Wail S. El-Kilani, Hatem M. Abdelkader, and Mohiy M. Hadhoud (2015). Fast Dimension-based Partitioning and Merging clustering algorithm. Appl. Soft Comput. 36, 143-151, 10.1016/j.asoc.2015.05.049, BibTeX
  16. Antonio Lavecchia (2015). Machine-learning approaches in drug discovery: methods and applications. Drug Discovery Today 20(3), 318-331, Elsevier BV, 10.1016/j.drudis.2014.10.012
  17. Francisco Maciá Pérez, José Vicente Berná-Martínez, Alberto Fernández Oliva, and Miguel Alfonso Abreu Ortega (2015). Algorithm for the detection of outliers based on the theory of rough sets. Decision Support Systems 75, 63-75, 10.1016/j.dss.2015.05.002, BibTeX
  18. Mohamed Bouguessa (2015). A practical outlier detection approach for mixed-attribute data. Expert Syst. Appl. 42(22), 8637-8649, 10.1016/j.eswa.2015.07.018, BibTeX
  19. Wen-qian Liu, Jun Liu, Meng Wang, Qinghua Zheng, Wei Zhang, Lingyun Song, and Siyu Yao (2015). Faceted fusion of RDF data. Information Fusion 23, 16-24, 10.1016/j.inffus.2014.06.005, BibTeX
  20. Seok-Ho Yoon, Ki-Nam Kim, Jiwon Hong, Sang-Wook Kim, and Sunju Park (2015). A community-based sampling method using DPL for online social networks. Inf. Sci. 306, 53-69, 10.1016/j.ins.2015.02.014, BibTeX
  21. Bifan Wei, Jun Liu, Qinghua Zheng, Wei Zhang, Chenchen Wang, and Bei Wu (2015). DF-Miner: Domain-specific facet mining by leveraging the hyperlink structure of Wikipedia. Knowl.-Based Syst. 77, 80-91, 10.1016/j.knosys.2015.01.001, BibTeX
  22. M. Peyro, M. Soheilypour, B.L. Lee, and M.R.K. Mofrad (2015). Evolutionarily Conserved Sequence Features Regulate the Formation of the FG Network at the Center of the Nuclear Pore Complex. Scientific Reports 5(1), Springer, 10.1038/srep15795
  23. Yang Zhao, Abhishek K. Shrivastava, and Kwok Leung Tsui (2015). Imbalanced Classification by Learning Hidden Data Structure. IIE Transactions, Informa UK Limited, 10.1080/0740817X.2015.1110269
  24. Anand Mehta, and Onkar Dikshit (2015). Comparative study on projected clustering methods for hyperspectral imagery classification. Geocarto International, 1-12, Informa UK Limited, 10.1080/10106049.2015.1047416
  25. Panagiotis Barlas, Ivor Lanning, and Cathal Heavey (2015). A survey of open source data science tools. Int. J. Intelligent Computing and Cybernetics 8(3), 232-261, 10.1108/IJICC-07-2014-0031, BibTeX
  26. Lei Xu, Chunxiao Jiang, and Yong Ren (2015). Deep learning in exploring semantic relatedness for microblog dimensionality reduction. GlobalSIP, 98-102, IEEE, 10.1109/GlobalSIP.2015.7418164, BibTeX
  27. Michael Wisely, Ali R. Hurson, and Sahra Sedigh Sarvestani (2015). An extensible simulation framework for evaluating centralized traffic prediction algorithms. ICCVE, 391-396, IEEE, 10.1109/ICCVE.2015.86, BibTeX
  28. Juan M. Banda, and Rafal A. Angryk (2015). Unsupervised Learning Techniques for Detection of Regions of Interest in Solar Images. ICDM Workshops, 582-588, IEEE, 10.1109/ICDMW.2015.61, BibTeX
  29. Guansong Pang, Kai Ming Ting, and David Albrecht (2015). LeSiNN: Detecting Anomalies by Identifying Least Similar Nearest Neighbours. ICDM Workshops, 623-630, IEEE, 10.1109/ICDMW.2015.62, BibTeX
  30. Erich Schubert, Michael Weiler, and Arthur Zimek (2015). Outlier Detection and Trend Detection: Two Sides of the Same Coin. ICDM Workshops, 40-46, IEEE, 10.1109/ICDMW.2015.79, BibTeX
  31. Fatemeh Riahi, and Oliver Schulte (2015). Model-Based Outlier Detection for Object-Relational Data. SSCI, 1590-1598, IEEE, 10.1109/SSCI.2015.224, BibTeX
  32. Milos Radovanovic, Alexandros Nanopoulos, and Mirjana Ivanovic (2015). Reverse Nearest Neighbors in Unsupervised Distance-Based Outlier Detection. IEEE Trans. Knowl. Data Eng. 27(5), 1369-1382, 10.1109/TKDE.2014.2365790, BibTeX
  33. Alvin Chiang, and Yi-Ren Yeh (2015). Anomaly Detection Ensembles: In Defense of the Average. WI-IAT (3), 207-210, IEEE, 10.1109/WI-IAT.2015.260, BibTeX
  34. Hezheng Yin, Joseph Bahman Moghadam, and Armando Fox (2015). Clustering Student Programming Assignments to Multiply Instructor Leverage. L@S, 367-372, ACM, 10.1145/2724660.2728695, BibTeX
  35. Neil Scicluna, and Christos-Savvas Bouganis (2015). ARC 2014: A Multidimensional FPGA-Based Parallel DBSCAN Architecture. TRETS 9(1), 2:1-2:15, 10.1145/2724722, BibTeX
  36. Ricardo J. G. B. Campello, Davoud Moulavi, Arthur Zimek, and Jörg Sander (2015). Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection. TKDD 10(1), 5:1-5:51, 10.1145/2733381, BibTeX
  37. Ling Chen, Ting Yu, and Rada Chirkova (2015). WaveCluster with Differential Privacy. CIKM, 1011-1020, ACM, 10.1145/2806416.2806546, BibTeX
  38. Yikai Gong, Fengmin Deng, and Richard O. Sinnott (2015). Identification of (near) Real-time Traffic Congestion in the Cities of Australia through Twitter. UCUI@CIKM, 7-12, ACM, 10.1145/2811271.2811276, BibTeX
  39. Charu C. Aggarwal, and Saket Sathe (2015). Theoretical Foundations and Algorithms for Outlier Ensembles. SIGKDD Explorations 17(1), 24-47, 10.1145/2830544.2830549, BibTeX
  40. Hugo Zeberg, Hugh P. C. Robinson, and Peter Århem (2015). Density of voltage-gated potassium channels is a bifurcation parameter in pyramidal neurons. Journal of Neurophysiology 113(2), 537-549, American Physiological Society, 10.1152/jn.00907.2013
  41. Benjamin Ducke (2015). Spatial Cluster Detection in Archaeology: Current Theory and Practice. Mathematics and Archaeology, 352-368, CRC Press, 10.1201/b18530-22
  42. Erich Schubert, Alexander Koos, Tobias Emrich, Andreas Züfle, Klaus Arthur Schmid, and Arthur Zimek (2015). A Framework for Clustering Uncertain Data. PVLDB 8(12), 1976-1979, 10.14778/2824032.2824115, BibTeX
  43. Fabien André, Anne-Marie Kermarrec, and Nicolas Le Scouarnec (2015). Cache locality is not enough: High-Performance Nearest Neighbor Search with Product Quantization Fast Scan. PVLDB 9(4), 288-299, 10.14778/2856318.2856324, BibTeX
  44. Patrick Oesterling, Patrick Jähnichen, Gerhard Heyer, and Gerik Scheuermann (2015). Topological visual analysis of clusterings in high-dimensional information spaces. it - Information Technology 57(1), 3-10, 10.1515/itit-2014-1073, BibTeX
  45. S. Gayathri, M. Mary Metilda, and S. Sanjai Babu (2015). A Shared Nearest Neighbour Density based Clustering Approach on a Proclus Method to Cluster High Dimensional Data. Indian Journal of Science and Technology 8(22), Indian Society for Education and Environment, 10.17485/ijst/2015/v8i22/79131
  46. I. A. Pestunov, S. A. Rylov, and V. B. Berikov (2015). Hierarchical clustering algorithms for segmentation of multispectral images. Optoelectronics, Instrumentation and Data Processing 51(4), 329-338, Allerton Press, 10.3103/S8756699015040020
  47. Lindsay Lloyd-Smith, John Krigbaum, and Benjamin Valentine (2015). Social affiliation, settlement pattern histories and subsistence change in Neolithic Borneo. Routledge Handbooks Online, 10.4324/9781315725444.ch13
  48. Mansi Gera, and Shivani Goel (2015). Data Mining - Techniques, Methods and Algorithms: A Review on Tools and their Validity. IJCA 113(18), 22-29, Foundation of Computer Science, 10.5120/19926-2042
  49. Smita Chormunge, and Sudarson Jena (2015). Efficiency and Effectiveness of Clustering Algorithms for High Dimensional Data. IJCA 125(11), 35-40, Foundation of Computer Science, 10.5120/ijca2015906144
  50. Alejandro Rituerto (2015). Modeling the environment with egocentric vision systems. ELCVIA Electronic Letters on Computer Vision and Image Analysis 14(3), Universitat Autonoma de Barcelona, 10.5565/rev/elcvia.739
  51. Veit Köppen, Mario Hildebrandt, and Martin Schäler (2015). On performance optimization potentials regarding data classification in forensics. BTW Workshops, 21-36, GI, BibTeX
  52. Jürgen Hermes, Michael Richter, and Claes Neuefeind (2015). Automatic Induction of German Aspectual Verb Classes in a Distributional Framework. GSCL, 122-129, GSCL e.V. BibTeX
  53. David Alfter (2015). Language Segmentation. CoRR abs/1510.01717, BibTeX
  54. Keqian Li (2015). On Integrating Information Visualization Techniques into Data Mining: A Review. CoRR abs/1503.00202, BibTeX
  55. Johannes Niedermayer (2015). Complex queries and complex data: challenges in similarity search. Ludwig Maximilians University Munich, BibTeX
  56. Matthias Rohr (2015). Workload-sensitive Timing Behavior Analysis for Fault Localization in Software Systems. University of Kiel, BibTeX
  57. Julien Soler (2015). Orion, A Generic Model for Data Mining: Application to Video Games. (Orion, un modèle générique pour la fouille de données: application aux jeux vidéo). University of Western Brittany, Brest, France, BibTeX
  58. Akshay Vishwanath Bhinge (2015). A comparative study on data mining tools.
  59. Barbora Micenková (2015). Outlier Detection and Explanation for Domain Experts. Department of Computer Science, University of Aarhus
  60. Bryan Omar Collazo Santiago (2015). Machine learning blocks. Massachusetts Institute of Technology
  61. Carl Levin, and Christopher Håkansson (2015). Clustering driver’s destinations - using internal evaluation to adaptively set parameters.
  62. Gilad Armon, Adiel Loinger, Uri Blatt, and Shahar Siegman (2015). Benchmarking In Online Advertising.
  63. Gordon O Ondego (2015). A comparative study of decision Tree and Naïve Bayesian Classifiers on Verbal Autopsy Datasets. University of Nairobi
  64. Guansong Pang (2015). Anomaly detection based on zero appearances in subspaces. Monash University. Faculty of Information Technology. Clayton School of Information Technology
  65. Guilherme Oliveira Campos (2015). Estudo, avaliação e comparação de técnicas de detecção não supervisionada de outliers. Biblioteca Digital de Teses e Dissertações da Universidade de São Paulo
  66. Irene Fernández Sánchez (2015). Diseño de una metodología de evaluación de servicios públicos basada en modelos analíticos sobre datos abiertos y de redes sociales. Telecomunicacion
  67. Jonathan von Brünken, Michael E. Houle, and Arthur Zimek (2015). Intrinsic Dimensional Outlier Detection in High-Dimensional Data. NII Technical Report (NII-2015-003E), NII
  68. Judit Kockat, and Clemens Rohde (2015). Conditions for local adaption of building policies in German cities according to their building structure and demography. ECEEE
  69. Katarzyna Racka (2015). Metody eksploracji danych i ich zastosowanie. Zeszyty Naukowe Państwowej Wyższej Szkoły Zawodowej w Płocku. Nauki Ekonomiczne 21 Wybrane problemy gospodarki europejskiej, 143-150
  70. Konstantinos Kontakis, and Κωνσταντίνος Κοντάκης (2015). Σημασιολογική περιγραφή σκηνών σε περιβάλλοντα εικονικής πραγματικότητας. Τ.Ε.Ι. Κρήτης, Σχολή Τεχνολογικών Εφαρμογών (Σ.Τ.Εφ), ΠΜΣ Πληροφορική και Πολυμέσα
  71. Křeček Martin (2015). Rozšíření platformy Clueminer o grafové algoritmy. České vysoké učení technické v Praze. Vypočetní a informační centrum.
  72. Lasanthi Nilmini Heendaliya (2015). Enabling near-term prediction of status for intelligent transportation systems: Management techniques for data on mobile objects. Missouri University of Science and Technology
  73. Lev Aleksandrovich Kazakovtsev, Aljona Aleksandrovna Stupina, Victor Ivanovich Orlov, Margarita Vladimirovna Karaseva, and Igor Sergeevich Masich (2015). Clustering Methods For Classification Of Electronic Devices By Production Batches And Quality Classes. Facta Universitatis, Series: Mathematics and Informatics 30(5), 567-581
  74. Lev Aleksandrovich Kazakovtsev, Victor Orlov, Aljona Aleksandrovna Stupina, and Vladimir Kazakovtsev (2015). Modied Genetic Algorithm with Greedy Heuristic for Continuous and Discrete p-Median Problems. Facta Universitatis, Series: Mathematics and Informatics 30(1), 89-106
  75. Mansi Gera, and Shivani Goel (2015). An Approach for Improving Accuracy of Prediction Using Ensemble Modeling.
  76. Markku Silén (2015). Symbolisen ja numeerisen laskennan ohjelmat opiskelijan apuna. Lapin ammattikorkeakoulu
  77. Preeti Bhargava (2015). Towards Proactive Context-aware Computing and Systems.
  78. Rashedul Amin Tuhin (2015). Securing GNSS Receivers with a Density-based Clustering Algorithm.
  79. Swetha Rajendiran (2015). Learning classification algorithms in data mining.
  80. Tharindu Bandaragoda (2015). Isolation based anomaly detection: a re-examination.
  81. Toon Van Craenendonck, and Hendrik Blockeel (2015). Limitations of using constraint set utility in semi-supervised clustering.
  82. Ulisses Costa, and Jorge Reis (2015). Incremental DBSCAN for Green Computing.
  83. Yan Liao, Jialin Hua, and Wensheng Zhu (2015). An Effective Divide-and-Merge Method for Hierarchical Clustering. American Scientific Publishers
  84. Zoraida Emperatriz Mamani Rodríguez (2015). Aplicación de la minería de datos distribuida usando algoritmo de clustering k-means para mejorar la calidad de servicios de las organizaciones modernas caso: Poder judicial. Universidad Nacional Mayor de San Marcos. Programa Cybertesis PERÚ
  85. Л.А. Казаковцев, А.А. Ступина, and В.И. Орлов (2015). Выбор Метрики Для Системы Автоматической Классификации Электрорадиоизделий По Производственным Партиям. Программные продукты и системы, Закрытое акционерное общество Научно-исследовательский институт “Центрпрограммсистем”
  86. 신동화, 이세희, and 서진욱 (2015). 계층 발생 프레임워크를 이용한 군집 계층 시각화. 정보과학회 컴퓨팅의 실제 논문지 21(6), 436-441


  1. Maria Camila Nardini Barioni, Humberto Luiz Razente, Alessandra M. R. Marcelino, Agma J. M. Traina, and Caetano Traina Jr. (2014). Open issues for partitioning clustering methods: an overview. Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 4(3), 161-177, 10.1002/widm.1127, BibTeX
  2. Mathilde Sahuguet, and Benoit Huet (2014). Mining the Web for Multimedia-Based Enriching. MMM (2), 263-274, Springer, 10.1007/978-3-319-04117-9_24, BibTeX
  3. Neil Scicluna, and Christos-Savvas Bouganis (2014). FPGA-Based Parallel DBSCAN Architecture. ARC, 1-12, Springer, 10.1007/978-3-319-05960-0_1, BibTeX
  4. Mahsa Salehi, Christopher A. Leckie, Masud Moshtaghi, and Tharshan Vaithianathan (2014). A Relevance Weighted Ensemble Model for Anomaly Detection in Switching Data Streams. PAKDD (2), 461-473, Springer, 10.1007/978-3-319-06605-9_38, BibTeX
  5. Sunil Aryal, Kai Ming Ting, Jonathan R. Wells, and Takashi Washio (2014). Improving iForest with Relative Mass. PAKDD (2), 510-521, Springer, 10.1007/978-3-319-06605-9_42, BibTeX
  6. Giuseppe Rizzo, Giacomo Falcone, Rosa Meo, Ruggero G. Pensa, Raphaël Troncy, and Vuk Milicic (2014). Geographic Summaries from Crowdsourced Data. ESWC (Satellite Events), 477-482, Springer, 10.1007/978-3-319-11955-7_70, BibTeX
  7. Johannes Niedermayer, and Peer Kröger (2014). Retrieval of Binary Features in Image Databases: A Study. SISAP, 151-163, Springer, 10.1007/978-3-319-11988-5_14, BibTeX
  8. Kirill Smirnov, George Chernishev, Pavel Fedotovsky, George Erokhin, and Kirill Cherednik (2014). The Study of Multidimensional R-Tree-Based Index Scalability in Multicore Environment. Ershov Memorial Conference, 266-272, Springer, 10.1007/978-3-662-46823-4_22, BibTeX
  9. Jeremy Steinhauer, Lois M. L. Delcambre, Marianne Lykke, and Marit Kristine Ådland (2014). Evaluating distance-based clustering for user (browse and click) sessions in a domain-specific collection. Int. J. on Digital Libraries 14(3-4), 167-179, 10.1007/s00799-014-0117-z, BibTeX
  10. Erich Schubert, Arthur Zimek, and Hans-Peter Kriegel (2014). Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection. Data Min. Knowl. Discov. 28(1), 190-237, 10.1007/s10618-012-0300-z, BibTeX
  11. Michael Davis, Weiru Liu, and Paul C. Miller (2014). Finding the most descriptive substructures in graphs with discrete and numeric labels. J. Intell. Inf. Syst. 42(2), 307-332, 10.1007/s10844-013-0299-7, BibTeX
  12. Chen Lin, Runquan Xie, Xinjun Guan, Lei Li, and Tao Li (2014). Personalized news recommendation via implicit social experts. Inf. Sci. 254, 1-18, 10.1016/j.ins.2013.08.034, BibTeX
  13. Jonathan R. Wells, Kai Ming Ting, and Takashi Washio (2014). LiNearN: A new approach to nearest neighbour density estimator. Pattern Recognition 47(8), 2702-2720, 10.1016/j.patcog.2014.01.013, BibTeX
  14. Allison Reilly, and Seth Guikema (2014). Bayesian Multiscale Modeling of Spatial Infrastructure Performance Predictions with an Application to Electric Power Outage Forecasting. J. Infrastruct. Syst., 04014036, American Society of Civil Engineers (ASCE), 10.1061/(ASCE)IS.1943-555X.0000222
  15. Hua Lou, and Ye Zhu (2014). Bivariate probability-based anomaly detection. BESC, 81-86, IEEE, 10.1109/BESC.2014.7059512, BibTeX
  16. Francesco Alex Indaco, and Teng-Sheng Moh (2014). Hierarchical Density-Based Clustering Using Level-Sets. CloudCom, 692-695, IEEE, 10.1109/CloudCom.2014.126, BibTeX
  17. Xuan-Hong Dang, Ira Assent, Raymond T. Ng, Arthur Zimek, and Erich Schubert (2014). Discriminative features for identifying and interpreting outliers. ICDE, 88-99, IEEE, 10.1109/ICDE.2014.6816642, BibTeX
  18. Tharindu R. Bandaragoda, Kai Ming Ting, David Albrecht, Fei Tony Liu, and Jonathan R. Wells (2014). Efficient Anomaly Detection by Isolation Using Nearest Neighbour Ensemble. ICDM Workshops, 698-705, IEEE, 10.1109/ICDMW.2014.70, BibTeX
  19. Tamer F. Ghanem, Wail S. Elkilani, Hatem S. Ahmed, and Mohiy M. Hadhoud (2014). DPM: Fast and scalable clustering algorithm for large scale high dimensional datasets. 2014 10th International Computer Engineering Conference (ICENCO), 26-35, IEEE, 10.1109/ICENCO.2014.7050427
  20. Johannes Blömer, Kathrin Bujna, and Daniel Kuntze (2014). A Theoretical and Experimental Comparison of the EM and SEM Algorithm. ICPR, 1419-1424, IEEE, 10.1109/ICPR.2014.253, BibTeX
  21. Alan Jovic, Karla Brkic, and Nikola Bogunovic (2014). An overview of free software tools for general data mining. MIPRO, 1112-1117, IEEE, 10.1109/MIPRO.2014.6859735, BibTeX
  22. Veit Köppen, Martin Schäler, and Reimar Schröter (2014). Toward variability management to tailor high dimensional index implementations. RCIS, 1-6, IEEE, 10.1109/RCIS.2014.6861069, BibTeX
  23. Erich Schubert, Arthur Zimek, and Hans-Peter Kriegel (2014). Generalized Outlier Detection with Flexible Kernel Density Estimates. SDM, 542-550, SIAM, 10.1137/1.9781611973440.63, BibTeX
  24. Mohamed Bouguessa (2014). A Mixture Model-Based Combination Approach for Outlier Detection. International Journal on Artificial Intelligence Tools 23(4), 10.1142/S0218213014600215, BibTeX
  25. Arthur Zimek, Ricardo J. G. B. Campello, and Jörg Sander (2014). Data perturbation for outlier detection ensembles. SSDBM, 13:1-13:12, ACM, 10.1145/2618243.2618257, BibTeX
  26. Xiao He, Jing Feng, Bettina Konte, Son T. Mai, and Claudia Plant (2014). Relevant overlapping subspace clusters on categorical data. KDD, 213-222, ACM, 10.1145/2623330.2623652, BibTeX
  27. Andreas Züfle, Tobias Emrich, Klaus Arthur Schmid, Nikos Mamoulis, Arthur Zimek, and Matthias Renz (2014). Representative clustering of uncertain data. KDD, 243-252, ACM, 10.1145/2623330.2623725, BibTeX
  28. Erich Schubert, Michael Weiler, and Hans-Peter Kriegel (2014). SigniTrend: scalable detection of emerging topics in textual streams by hashed significance thresholds. KDD, 871-880, ACM, 10.1145/2623330.2623740, BibTeX
  29. Shaobin Huang, Yuan Cheng, Dapeng Lang, Ronghua Chi, and Guofeng Liu (2014). A Formal Algorithm for Verifying the Validity of Clustering Results Based on Model Checking. PLoS ONE 9(3), e90109, Public Library of Science (PLoS), 10.1371/journal.pone.0090109
  30. Reka K. Kelemen, Gengen F. He, Hannah L. Woo, Thomas Lane, Caroline Rempe, Jun Wang, Ian A. Cockburn, Rogerio Amino, Vitaly V. Ganusov, and Michael W. Berry (2014). Classification of T cell movement tracks allows for prediction of cell function. I. J. Computational Biology and Drug Design 7(2/3), 113-129, 10.1504/IJCBDD.2014.061655, BibTeX
  31. Deborah Falcone, Cecilia Mascolo, Carmela Comito, Domenico Talia, and Jon Crowcroft (2014). What is this place? Inferring place categories through user patterns identification in geo-tagged tweets. MobiCASE, 10-19, IEEE, 10.4108/icst.mobicase.2014.257683, BibTeX
  32. A. Mehta, and O. Dikshit (2014). SPCA Assisted Correlation Clustering of Hyperspectral Imagery. ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences II-8, 111-116, Copernicus GmbH, 10.5194/isprsannals-II-8-111-2014
  33. Mojgan Pourrajabi, Davoud Moulavi, Ricardo J. G. B. Campello, Arthur Zimek, Jörg Sander, and Randy Goebel (2014). Model Selection for Semi-Supervised Clustering. EDBT, 331-342,, 10.5441/002/edbt.2014.31, BibTeX
  34. Felix Stahlberg, Tim Schlippe, Stephan Vogel, and Tanja Schultz (2014). Towards automatic speech recognition without pronunciation dictionary, transcribed speech and text resources in the target language using cross-lingual word-to-phoneme alignment. SLTU, 73-80, ISCA, BibTeX
  35. Stephan Günnemann, Hardy Kremer, Matthias Hannen, and Thomas Seidl (2014). KDD-SC: Subspace Clustering Extensions for Knowledge Discovery Frameworks. CoRR abs/1407.3850, BibTeX
  36. Albrecht Zimmermann (2014). A feature construction framework based on outlier detection and discriminative pattern mining. CoRR abs/1407.4668, BibTeX
  37. Xiao He (2014). Multi-purpose exploratory mining of complex data. Ludwig Maximilians University Munich, BibTeX
  38. Richard Röttger (2014). Active transitivity clustering of large-scale biomedical datasets. Saarland University, BibTeX
  39. Michael Davis (2014). Discovering patterns and anomalies in graphs with discrete and numeric attributes. Queen’s University Belfast, UK, BibTeX
  40. Ibrahim Mithgal Aljarah (2014). MapReduce-enabled scalable nature-inspired approaches for clustering. North Dakota State University, 978-1-303-83676-3
  41. Samuel Valentine (2014). Sentiment Analysis 19 Success Secrets - 19 Most Asked Questions On Sentiment Analysis - What You Need To Know. Emereo Publishing, 9781488535208
  42. Adnan Karaibrahimoğlu (2014). Veri madenciliğinden birliktelik kuralı ile onkoloji verilerinin analiz edilmesi: Meram Tıp Fakültesi Onkoloji örneği (Analyzing breast cancer data using association rule mining: Meram Faculty of Medicine Oncology Department). Selçuk Üniversitesi Fen Bilimleri Enstitüsü
  43. Andrea Bagnacani (2014). Linked Data e bibliometriche: un indice di multidisciplinarieta nel Semantic Publishing.
  44. Björn Löfroth (2014). Mobile traffic dataset comparisons throughcluster analysis of radio network event sequences.
  45. Dominique Legallois, Solen Quiniou, Peggy Cellier, and Thierry Charnois (2014). Graph Mining under Linguistic Constraints for Exploring Large Texts. Instituto Politécnico Nacional
  46. Haofan Zhang (2014). Spectral Ranking and Unsupervised Feature Selection for Point, Collective and Contextual Anomaly Detection.
  47. Henrik Larsson, and Erik Lindqvist (2014). Unsupervised Outlier Detection in Software Engineering. Institutionen för data- och informationsteknik (Chalmers), Chalmers tekniska högskola
  48. João Luiz Grave Gross (2014). URSA: um framework para agrupamento de dados e validação de resultados (URSA: a framework for data clustering and data analysis).
  49. Muhammad Sohail (2014). Calculation of Energy Footprint of Manufacturing Assets.
  50. Nicola Padovano, and Elia Filiberto Polo (2014). Progetto e realizzazione di un framework per Neosperience sul clustering di reti sociali. Italy
  51. Pratik Kumar Mishra, Dinesh Pothineni, Aadil Rasheed, Deepak Sundararajan, Ashok Krish, Hasit Kaji, and Tata Consultancy Services Limited (2014). System and Method for Determining an Expert of a Subject on a Web-based Platform.
  52. R.J. Ma, and N.Y. Yu (2014). A new route for energy efficiency diagnosis and potential analysis of energy consumption from air-conditioning system. Energy Systems Laboratory (
  53. Reka Katalin Kelemen (2014). Mathematical modeling of T cell clustering following malaria infection in mice. University of Tennessee, Knoxville
  54. Ritesh Shukla (2014). Machine learning ecosystem: implications for business strategy centered on machine learning. Massachusetts Institute of Technology
  55. Robert F Erbacher, and Robinson Pino (2014). Open Source Software Tools for Anomaly Detection Analysis. ARL-MR-0869, Army Research Lab Adelphi MD Computational and Information Sciences Directorate
  56. Sheila Mollá Santiago (2014). Generalització de mètodes de density-based clustering a dades mixtes. Universitat Politècnica de Catalunya
  57. Tim Zwietasch (2014). Detecting anomalies in system log files using machine learning techniques. Uni Stuttgart - Universitätsbibliothek
  58. Tânia Margarida dos Santos Gomes (2014). Ferramentas open source de Data Mining.
  59. V. Ilango (2014). Forecasting Methods Based on Outlier Detection And Influential Point Observation on Clustering Techniques Using Financial Time Series Data. Virudhunagar
  60. Y.P.J.M. van Oirschot (2014). Using Trace Clustering for Configurable Process Discovery Explained by Event Log Data.
  61. И.А. Пестунов, and С.А. Рылов (2014). Метод построения ансамбля сеточных иерархических алгоритмов кластеризации для сегментации спутниковых изображений. Региональные проблемы дистанционного зондирования Земли, 215-223
  62. Казаковцев Лев Александрович, Орлов Виктор Иванович, Ступина Алена Александровна, and Масич Игорь Сергеевич (2014). Задача классификации электронной компонентной базы. Вестник Сибирского государственного университета науки и технологий имени академика М. Ф. Решетнева, Федеральное государственное бюджетное образовательное учреждение высшего образования «Сибирский государственный университет науки и технологий имени академика М.Ф. Решетнева»


  1. Zeyar Aung (2013). Database Systems for the Smart Grid. Smart Grids, 151-168, Springer, 10.1007/978-1-4471-5210-1_7
  2. Charu C. Aggarwal (2013). Outlier Analysis. Springer, 10.1007/978-1-4614-6396-2, BibTeX
  3. Charu C. Aggarwal (2013). Applications of Outlier Analysis. Outlier Analysis, 373-400, Springer, 10.1007/978-1-4614-6396-2_12
  4. Charu C. Aggarwal (2013). High-Dimensional Outlier Detection: The Subspace Method. Outlier Analysis, 135-167, Springer, 10.1007/978-1-4614-6396-2_5
  5. Jordi Nin, David Carrera, and Daniel Villatoro (2013). On the Use of Social Trajectory-Based Clustering Methods for Public Transport Optimization. CitiSens, 59-70, Springer, 10.1007/978-3-319-04178-0_6, BibTeX
  6. Mark J. Embrechts, Christopher J. Gatti, Jonathan Linton, and Badrinath Roysam (2013). Hierarchical Clustering for Large Data Sets. Advances in Intelligent Signal Processing and Data Mining, 197-233, Springer, 10.1007/978-3-642-28696-4_8
  7. Mariusz Oszust, and Marian Wysocki (2013). Clustering and Classification of Time Series Representing Sign Language Words. ICAISC (2), 218-229, Springer, 10.1007/978-3-642-38610-7_21, BibTeX
  8. Rana Momtaz, Nesma Mohssen, and Mohammad A. Gowayyed (2013). DWOF: A Robust Density-Based Outlier Detection Approach. IbPRIA, 517-525, Springer, 10.1007/978-3-642-38628-2_61, BibTeX
  9. Felix Stahlberg, Tim Schlippe, Stephan Vogel, and Tanja Schultz (2013). Pronunciation Extraction from Phoneme Sequences through Cross-Lingual Word-to-Phoneme Alignment. SLSP, 260-272, Springer, 10.1007/978-3-642-39593-2_23, BibTeX
  10. Tobias Emrich, Hans-Peter Kriegel, Peer Kröger, Johannes Niedermayer, Matthias Renz, and Andreas Züfle (2013). Reverse-k-Nearest-Neighbor Join Processing. SSTD, 277-294, Springer, 10.1007/978-3-642-40235-7_16, BibTeX
  11. Erich Schubert, Arthur Zimek, and Hans-Peter Kriegel (2013). Geodetic Distance Queries on R-Trees for Indexing Geographic Data. SSTD, 146-164, Springer, 10.1007/978-3-642-40235-7_9, BibTeX
  12. Jeremy Steinhauer, Lois M. L. Delcambre, Marianne Lykke, and Marit Kristine Ådland (2013). Do User (Browse and Click) Sessions Relate to Their Questions in a Domain-Specific Collection?. TPDL, 96-107, Springer, 10.1007/978-3-642-40501-3_10, BibTeX
  13. Enikö Székely, Pascal Poncelet, Florent Masseglia, Maguelonne Teisseire, and Renaud Cezar (2013). A Density-Based Backward Approach to Isolate Rare Events in Large-Scale Applications. Discovery Science, 249-264, Springer, 10.1007/978-3-642-40897-7_17, BibTeX
  14. Xuan-Hong Dang, Barbora Micenková, Ira Assent, and Raymond T. Ng (2013). Local Outlier Detection with Interpretation. ECML/PKDD (3), 304-320, Springer, 10.1007/978-3-642-40994-3_20, BibTeX
  15. Part Pramokchon, and Punpiti Piamsa-nga (2013). An Unsupervised, Fast Correlation-Based Filter for Feature Selection for Data Clustering. DaEng, 87-94, Springer, 10.1007/978-981-4585-18-7_10, BibTeX
  16. Christophe Jardin, Arno G. Stefani, Martin Eberhardt, Johannes B. Huber, and Heinrich Sticht (2013). An information-theoretic classification of amino acids for the assessment of interfaces in protein–protein docking. Journal of Molecular Modeling 19(9), 3901-3910, Springer, 10.1007/s00894-013-1916-7
  17. Kai Ming Ting, Takashi Washio, Jonathan R. Wells, Fei Tony Liu, and Sunil Aryal (2013). DEMass: a new density estimator for big data. Knowl. Inf. Syst. 35(3), 493-524, 10.1007/s10115-013-0612-3, BibTeX
  18. Kai Ming Ting, Guang-Tong Zhou, Fei Tony Liu, and Swee Chuan Tan (2013). Mass estimation. Machine Learning 90(1), 127-160, 10.1007/s10994-012-5303-x, BibTeX
  19. Ibrahim Aljarah, and Simone A. Ludwig (2013). A new clustering approach based on Glowworm Swarm Optimization. IEEE Congress on Evolutionary Computation, 2642-2649, IEEE, 10.1109/CEC.2013.6557888, BibTeX
  20. Yang Zhao, and Abhishek K. Shrivastava (2013). Combating Sub-Clusters Effect in Imbalanced Classification. ICDM, 1295-1300, IEEE, 10.1109/ICDM.2013.105, BibTeX
  21. Barbora Micenková, Raymond T. Ng, Xuan-Hong Dang, and Ira Assent (2013). Explaining Outliers by Subspace Separability. ICDM, 518-527, IEEE, 10.1109/ICDM.2013.132, BibTeX
  22. Arian Bär, Antonio Paciello, and Peter Romirer-Maierhofer (2013). Trapping botnets by DNS failure graphs: Validation, extension and application to a 3G network. INFOCOM, 3159-3164, IEEE, 10.1109/INFCOM.2013.6567131, BibTeX
  23. Arian Bär, Antonio Paciello, and Peter Romirer-Maierhofer (2013). Trapping botnets by DNS failure graphs: Validation, extension and application to a 3G network. INFOCOM Workshops, 393-398, IEEE, 10.1109/INFCOMW.2013.6562863, BibTeX
  24. Amine Chaibi, Mustapha Lebbah, and Hanane Azzag (2013). A New Visualization of Group-Outliers in Unsupervised Learning. IV, 162-167, IEEE, 10.1109/IV.2013.20, BibTeX
  25. Elke Achtert, Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2013). Interactive data mining with 3D-parallel-coordinate-trees. SIGMOD Conference, 1009-1012, ACM, 10.1145/2463676.2463696, BibTeX
  26. Arthur Zimek, Matthew Gaudet, Ricardo J. G. B. Campello, and Jörg Sander (2013). Subsampling for efficient and effective unsupervised outlier detection ensembles. KDD, 428-436, ACM, 10.1145/2487575.2487676, BibTeX
  27. Benjamin Welton, Evan Samanas, and Barton P. Miller (2013). Mr. Scan: extreme scale density-based clustering using a tree-based network of GPGPU nodes. SC, 84:1-84:11, ACM, 10.1145/2503210.2503262, BibTeX
  28. Johannes Schneider, and Michail Vlachos (2013). Fast parameterless density-based clustering via random projections. CIKM, 861-866, ACM, 10.1145/2505515.2505590, BibTeX
  29. Toon De Pessemier, Simon Dooms, and Luc Martens (2013). A food recommender for patients in a care facility. RecSys, 209-212, ACM, 10.1145/2507157.2507198, BibTeX
  30. David Ando, Michael Colvin, Michael Rexach, and Ajay Gopinathan (2013). Physical Motif Clustering within Intrinsically Disordered Nucleoporin Sequences Reveals Universal Functional Features. PLoS ONE 8(9), e73831, Public Library of Science (PLoS), 10.1371/journal.pone.0073831
  31. Martin Schäler, Alexander Grebhahn, Reimar Schröter, Sandro Schulze, Veit Köppen, and Gunter Saake (2013). QuEval: Beyond high-dimensional indexing a la carte. PVLDB 6(14), 1654-1665, 10.14778/2556549.2556551, BibTeX
  32. Martin Behnisch, Gotthard Meinel, Sebastian Tramsen, and Markus Diesselmann (2013). Using quadtree representations in building stock visualization and analysis. Erdkunde 67(2), 151-166, Erdkunde, 10.3112/erdkunde.2013.02.04
  33. Jai PrakashVerma, Bankim Patel, and Atul Patel (2013). Web Mining: Opinion and Feedback Analysis for Educational Institutions. International Journal of Computer Applications 84(6), 17-22, Foundation of Computer Science, 10.5120/14579-2800
  34. Charu C. Aggarwal, and Chandan K. Reddy (2013). Educational and Software Resources for Data Clustering. Data Clustering: Algorithms and Applications, 607-616, BibTeX
  35. Arthur Zimek (2013). Clustering High-Dimensional Data. Data Clustering: Algorithms and Applications, 201-230, BibTeX
  36. Tobias Emrich, Peer Kröger, Johannes Niedermayer, Matthias Renz, and Andreas Züfle (2013). A Mutual Pruning Approach for RkNN Join Processing. BTW, 21-35, GI, BibTeX
  37. Tobias Emrich (2013). Coping with distance and location dependencies in spatial, temporal and uncertain data. Ludwig Maximilians University Munich, BibTeX
  38. Daniel Kuntze (2013). Practical algorithms for clustering and modeling large data sets: analysis and improvements. 1-130, University of Paderborn, BibTeX
  39. Erich Schubert (2013). Generalized and efficient outlier detection for spatial, temporal, and high-dimensional data mining. 1-262, Ludwig Maximilians University Munich, BibTeX
  40. Andreas Züfle (2013). Similarity search and mining in uncertain spatial and spatio-temporal databases. 1-397, Ludwig Maximilians University Munich, BibTeX
  41. Matthew Orlinski (2013). Neighbour discovery and distributed spatio-temporal cluster detection in pocket switched networks. University of Manchester, UK, BibTeX
  42. Claire Elizabeth Q (2013). Machine learning analysis of the cultural and cross-cultural aspects of beauty in music. Aberystwyth University, UK, BibTeX
  43. Thomas H. Davenport, and Jinho Kim (2013). Keeping Up with the Quants. Your Guide to Understanding and Using Analytics. Harvard Business Press, 9781422187265
  44. I. Menken (2013). Data Mining Guidance - Real World Application, Templates, Documents, and Examples of the use of Data Mining in the Public Domain. Emereo Publishing, 9781486460458
  45. Albrecht Zimmermann (2013). Feature construction based on class outliers. CW Reports
  46. Bruno Daigle (2013). Méthodes bioinformatiques pour l’évaluation de la classification du virus du papillome humain. Université du Québec à Montréal
  47. Curdin Barandun, Stefan Derungs, and Gino Paulaitis (2013). Mixtape: Analyse und Erstellung Ähnlichkeitsanalyse von Musik anhand einer praktischen Implementation. HSR Hochschule für Technik Rapperswil
  48. Hardy Kremer (2013). Mining and similarity search in temporal databases. Aachen, Techn. Hochsch., Diss., 2013
  49. Jan Vykopal (2013). Flow-based Brute-force Attack Detection in Large and High-speed Networks. Masarykova univerzita
  50. Jan Vykopal (2013). SimFlow - a similarity-based detection of brute-force attacks.
  51. Kai M Ting (2013). Second Generation of Mass Estimation. Monash Univ Churchill (Australia) Gippsland School Of Information Technology
  52. Luiz O. Carvalho, Thatyana F. P. Seraphim, Caetano Traina Júnior, and Enzo Seraphim (2013). ObInject: a NoODMG Persistence and Indexing Framework for Object Injection. Journal of Information and Data Management 4(3), 220
  53. Manish Gupta (2013). Outlier detection for information networks. University of Illinois at Urbana-Champaign
  54. N Ronald (2013). Workers, adventurers, explorers: uncovering activity patterns in Melbourne. Australasian Transport Research Forum (ATRF), 36th, 2013, Brisbane, Queensland, Australia
  55. Solen Quiniou, Peggy Cellier, Thierry Charnois, and Dominique Legallois (2013). Graph Mining under Linguistic Constraints to Explore Large Texts. International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’13)
  56. Stefan Eduard Raposo Alves (2013). Towards improving WEBSOM with multi-word expressions. Faculdade de Ciências e Tecnologia
  57. Vladimír Matejovský (2013). Podpora shlukování webových stránek pomocí link mining. [online].


  1. Arthur Zimek, Erich Schubert, and Hans-Peter Kriegel (2012). A survey on unsupervised outlier detection in high-dimensional numerical data. Statistical Analysis and Data Mining 5(5), 363-387, 10.1002/sam.11161, BibTeX
  2. Hans-Peter Kriegel, Peer Kröger, and Arthur Zimek (2012). Subspace clustering. Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 2(4), 351-364, 10.1002/widm.1057, BibTeX
  3. Charu C. Aggarwal (2012). An Introduction to Outlier Analysis. Outlier Analysis, 1-40, Springer, 10.1007/978-1-4614-6396-2_1
  4. Dawn E. Holmes, Jeffrey Tweedale, and Lakhmi C. Jain (2012). Data Mining Techniques in Clustering, Association and Classification. Data Mining: Foundations and Intelligent Paradigms, 1-6, Springer, 10.1007/978-3-642-23166-7_1
  5. Philipp Kranen, Hardy Kremer, Timm Jansen, Thomas Seidl, Albert Bifet, Geoff Holmes, Bernhard Pfahringer, and Jesse Read (2012). Stream Data Mining Using the MOA Framework. DASFAA (2), 309-313, Springer, 10.1007/978-3-642-29035-0_27, BibTeX
  6. Ira Assent, Philipp Kranen, Corinna Baldauf, and Thomas Seidl (2012). AnyOut: Anytime Outlier Detection on Streaming Data. DASFAA (1), 228-242, Springer, 10.1007/978-3-642-29038-1_18, BibTeX
  7. eva Kühn, Alexander Marek, Thomas Scheller, Vesna Sesum-Cavic, Michael Vögler, and Stefan Craß (2012). A Space-Based Generic Pattern for Self-Initiative Load Clustering Agents. COORDINATION, 230-244, Springer, 10.1007/978-3-642-30829-1_16, BibTeX
  8. Emmanuel Müller, Fabian Keller, Sebastian Blanc, and Klemens Böhm (2012). OutRules: A Framework for Outlier Descriptions in Multiple Context Spaces. ECML/PKDD (2), 828-832, Springer, 10.1007/978-3-642-33486-3_57, BibTeX
  9. Mohamed Bouguessa (2012). Modeling Outlier Score Distributions. ADMA, 713-725, Springer, 10.1007/978-3-642-35527-1_59, BibTeX
  10. Boris Delibasic, Milan Vukicevic, Milos Jovanovic, Kathrin Kirchner, Johannes Ruhland, and Milija Suknovic (2012). An architecture for component-based design of representative-based clustering algorithms. Data Knowl. Eng. 75, 78-98, 10.1016/j.datak.2012.03.005, BibTeX
  11. Elke Achtert, Sascha Goldhofer, Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2012). Evaluation of Clusterings - Metrics and Visual Support. ICDE, 1285-1288, IEEE, 10.1109/ICDE.2012.128, BibTeX
  12. Hans-Peter Kriegel, Peer Kröger, Erich Schubert, and Arthur Zimek (2012). Outlier Detection in Arbitrarily Oriented Subspaces. ICDM, 379-388, IEEE, 10.1109/ICDM.2012.21, BibTeX
  13. Mohamed Bouguessa (2012). A Probabilistic Combination Approach to Improve Outlier Detection. ICTAI, 666-673, IEEE, 10.1109/ICTAI.2012.95, BibTeX
  14. Monalisa Mandal, and Anirban Mukhopadhyay (2012). Identifying most relevant non-redundant gene markers from gene expression data using PSO-based graph -theoretic approach. 2012 2nd IEEE International Conference on Parallel, Distributed and Grid Computing, 374-379, IEEE, 10.1109/PDGC.2012.6449849
  15. Erich Schubert, Remigius Wojdanowski, Arthur Zimek, and Hans-Peter Kriegel (2012). On Evaluation of Outlier Rankings and Outlier Scores. SDM, 1047-1058, SIAM / Omnipress, 10.1137/1.9781611972825.90, BibTeX
  16. Thomas Bernecker, Franz Graf, Hans-Peter Kriegel, Nepomuk Seiler, Christoph Türmer, and Dieter Dill (2012). Knowing: a generic data analysis application. EDBT, 630-633, ACM, 10.1145/2247596.2247683, BibTeX
  17. Stephan Günnemann, Ines Färber, Kittipat Virochsiri, and Thomas Seidl (2012). Subspace correlation clustering: finding locally correlated dimensions in subspace projections of the data. KDD, 352-360, ACM, 10.1145/2339530.2339588, BibTeX
  18. Linda Dib, and Alessandra Carbone (2012). CLAG: an unsupervised non hierarchical clustering algorithm handling biological data. BMC Bioinformatics 13, 194, 10.1186/1471-2105-13-194, BibTeX
  19. Thomas Bernecker (2012). Similarity processing in multi-observation data. 1-253, Ludwig Maximilian University of Munich, Germany, BibTeX
  20. Franz Graf (2012). Data and knowledge engineering for medical image and sensor data. 1-221, Ludwig Maximilian University of Munich, Germany, BibTeX
  21. Steffen Suchandt, and Hartmut Runge (2012). Along-track interferometry using TanDEM-X: First results from marine and land applications. EUSAR 2012; 9th European Conference on Synthetic Aperture Radar, 392-395, VDE, 978-3-8007-3404-7
  22. Bruno Tavares (2012). Sistema de recomendação para plataformas de e-learning. Instituto Politécnico do Porto. Instituto Superior de Engenharia do Porto
  23. E. B. Beuschau (2012). Learning usage behavior based on app feedback.
  24. Francesco Indaco (2012). Hierarchical Clustering Using Level Sets. San Jose State University
  25. Jens Ehlers (2012). Self-Adaptive Performance Monitoring for Component-Based Software Systems. 252, Books on Demand GmbH
  26. Γρηγόριος Αθανασίου (2012). Business plan νέας ηλεκτρονικής επιχείρησης (Δημιουργία-Εφαρμογή). Πανεπιστήμιο Μακεδονίας Οικονομικών και Κοινωνικών Επιστημών
  27. Νικόλαος Δ. Γρίβας, and Nikolaos D. Grivas (2012). Υπολογισμός ισοχρονικών καμπύλων χρονοαπόστασης σε οδικά δίκτυα (Isochrone computation on road networks).


  1. Hans-Peter Kriegel, Peer Kröger, Jörg Sander, and Arthur Zimek (2011). Density-based clustering. Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery 1(3), 231-240, 10.1002/widm.30, BibTeX
  2. Thomas Bernecker, Michael E. Houle, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Erich Schubert, and Arthur Zimek (2011). Quality of Similarity Rankings in Time Series. SSTD, 422-440, Springer, 10.1007/978-3-642-22922-0_25, BibTeX
  3. Elke Achtert, Ahmed Hettab, Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2011). Spatial Outlier Detection: Data, Algorithms, Visualizations. SSTD, 512-516, Springer, 10.1007/978-3-642-22922-0_41, BibTeX
  4. Yong Shi, and Li Zhang (2011). COID: A cluster-outlier iterative detection approach to multi-dimensional data analysis. Knowl. Inf. Syst. 28(3), 709-733, 10.1007/s10115-010-0323-y, BibTeX
  5. Kai Ming Ting, Takashi Washio, Jonathan R. Wells, and Fei Tony Liu (2011). Density Estimation Based on Mass. ICDM, 715-724, IEEE, 10.1109/ICDM.2011.47, BibTeX
  6. Hans-Peter Kriegel, Peer Kröger, Erich Schubert, and Arthur Zimek (2011). Interpreting and Unifying Outlier Scores. SDM, 13-24, SIAM / Omnipress, 10.1137/1.9781611972818.2, BibTeX
  7. Claudia Plant (2011). SONAR: Signal De-mixing for Robust Correlation Clustering. SDM, 319-330, SIAM / Omnipress, 10.1137/1.9781611972818.28, BibTeX
  8. Anca Maria Ivanescu, Thivaharan Albin, Dirk Abel, and Thomas Seidl (2011). Employing correlation clustering for the identification of piecewise affine models. Proceedings of the 2011 workshop on Knowledge discovery, modeling and simulation - KDMS ‘11, ACM Press, 10.1145/2023568.2023575
  9. Stephan Günnemann, Hardy Kremer, and Thomas Seidl (2011). An extension of the PMML standard to subspace clustering models. PMML ‘11, 48-53, ACM, 10.1145/2023598.2023605
  10. Resat Selbas, Arzu Sencan, and Ecir U. (2011). Data Mining Method For Energy System Aplications. Knowledge-Oriented Applications in Data Mining, InTech, 10.5772/13710
  11. Emmanuel Müller, Ira Assent, Stephan Günnemann, Patrick Gerwert, Matthias Hannen, Timm Jansen, and Thomas Seidl (2011). A Framework for Evaluation and Exploration of Clustering Algorithms in Subspaces of High Dimensional Databases. BTW, 347-366, GI, BibTeX
  12. Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2011). Evaluation of Multiple Clustering Solutions. MultiClust@ECML/PKDD, 55-66,, BibTeX
  13. Johan Mazel (2011). Unsupervised network anomaly detection. INSA de Toulouse
  14. Ευλάμπιος Αποστολίδης (2011). Συγκριτική μελέτη μεθόδων κατασκευής του R* TREE με όρους αποδοτικότητας για ερωτήματα κοντινότερου γείτονα σε πολυδιάστατους χώρους δεδομένων. Πανεπιστήμιο Μακεδονίας Οικονομικών και Κοινωνικών Επιστημών


  1. Elke Achtert, Hans-Peter Kriegel, Lisa Reichert, Erich Schubert, Remigius Wojdanowski, and Arthur Zimek (2010). Visual Evaluation of Outlier Detection Models. DASFAA (2), 396-399, Springer, 10.1007/978-3-642-12098-5_34, BibTeX
  2. Dominik Benz, Andreas Hotho, Robert Jäschke, Beate Krause, Folke Mitzlaff, Christoph Schmitz, and Gerd Stumme (2010). The social bookmark and publication management system bibsonomy - A platform for evaluating and demonstrating Web 2.0 research. VLDB J. 19(6), 849-875, 10.1007/s00778-010-0208-4, BibTeX
  3. Arik Messerman, Tarik Mustafic, Seyit Ahmet Çamtepe, and Sahin Albayrak (2010). A generic framework and runtime environment for development and evaluation of behavioral biometrics solutions. ISDA, 136-141, IEEE, 10.1109/ISDA.2010.5687276, BibTeX
  4. Bilkis J. Ferdosi, Hugo Buddelmeijer, Scott C. Trager, Michael H. F. Wilkinson, and Jos B. T. M. Roerdink (2010). Finding and visualizing relevant subspaces for clustering high-dimensional astronomical data using connected morphological operators. IEEE VAST, 35-42, IEEE, 10.1109/VAST.2010.5652450, BibTeX
  5. Tobias Emrich, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, and Andreas Züfle (2010). Boosting spatial pruning: on optimal pruning of MBRs. SIGMOD Conference, 39-50, ACM, 10.1145/1807167.1807174, BibTeX
  6. Kai Ming Ting, Guang-Tong Zhou, Fei Tony Liu, and James Swee Chuan Tan (2010). Mass estimation and its applications. KDD, 989-998, ACM, 10.1145/1835804.1835929, BibTeX
  7. Tobias Emrich, Franz Graf, Hans-Peter Kriegel, Matthias Schubert, and Marisa Thoma (2010). On the impact of flash SSDs on spatial indexing. DaMoN, 3-8, ACM, 10.1145/1869389.1869390, BibTeX
  8. Albert Hein, and Thomas Kirste (2010). Unsupervised detection of motion primitives in very high dimensional sensor data. BMI, 22-37,


  1. Hans-Peter Kriegel, Peer Kröger, Erich Schubert, and Arthur Zimek (2009). Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data. PAKDD, 831-838, Springer, 10.1007/978-3-642-01307-2_86, BibTeX
  2. Elke Achtert, Thomas Bernecker, Hans-Peter Kriegel, Erich Schubert, and Arthur Zimek (2009). ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series. SSTD, 436-440, Springer, 10.1007/978-3-642-02982-0_35, BibTeX
  3. Gabriela Moise, Arthur Zimek, Peer Kröger, Hans-Peter Kriegel, and Jörg Sander (2009). Subspace and projected clustering: experimental evaluation and analysis. Knowl. Inf. Syst. 21(3), 299-326, 10.1007/s10115-009-0226-y, BibTeX
  4. Hans-Peter Kriegel, Peer Kröger, and Arthur Zimek (2009). Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. TKDD 3(1), 1:1-1:58, 10.1145/1497577.1497578, BibTeX
  5. Hans-Peter Kriegel, Peer Kröger, Erich Schubert, and Arthur Zimek (2009). LoOP: local outlier probabilities. CIKM, 1649-1652, ACM, 10.1145/1645953.1646195, BibTeX
  6. Arthur Zimek (2009). Correlation clustering. SIGKDD Explorations 11(1), 53-54, 10.1145/1656274.1656286, BibTeX


  1. Elke Achtert, Hans-Peter Kriegel, and Arthur Zimek (2008). ELKI: A Software System for Evaluation of Subspace Clustering Algorithms. SSDBM, 580-585, Springer, 10.1007/978-3-540-69497-7_41, BibTeX

Finding more

Papers that cite ELKI releases can be found using google scholar:

Release 0.1: Semantic Scholar Google Scholar

Release 0.2: Semantic Scholar Google Scholar

Release 0.3: Semantic Scholar Google Scholar

Release 0.4: Semantic Scholar Google Scholar

Release 0.5: Semantic Scholar Google Scholar

Release 0.6: Semantic Scholar Google Scholar

Release 0.7: Semantic Scholar Google Scholar