Arvid Heise
Arvid Heise
Verified email at hpi.de
Title
Cited by
Cited by
Year
The stratosphere platform for big data analytics
A Alexandrov, R Bergmann, S Ewen, JC Freytag, F Hueske, A Heise, ...
The VLDB Journal 23 (6), 939-964, 2014
5022014
Progressive duplicate detection
T Papenbrock, A Heise, F Naumann
IEEE Transactions on knowledge and data engineering 27 (5), 1316-1329, 2014
1042014
Scalable discovery of unique column combinations
A Heise, JA Quiané-Ruiz, Z Abedjan, A Jentzsch, F Naumann
Proceedings of the VLDB Endowment 7 (4), 301-312, 2013
792013
GovWILD: integrating open government data for transparency
C Böhm, M Freitag, A Heise, C Lehmann, A Mascher, F Naumann, ...
Proceedings of the 21st International Conference on World Wide Web, 321-324, 2012
572012
Integrating open government data with stratosphere for more transparency
A Heise, F Naumann
Journal of Web Semantics 14, 45-56, 2012
482012
Meteor/sopremo: An extensible query language and operator model
A Heise, A Rheinländer, M Leich, U Leser, F Naumann
Workshop on End-to-end Management of Big Data, Istanbul, Turkey, 2012
482012
SOFA: An extensible logical optimizer for UDF-heavy data flows
A Rheinländer, A Heise, F Hueske, U Leser, F Naumann
Information Systems 52, 96-125, 2015
352015
The SOM family: virtual machines for teaching and research
M Haupt, R Hirschfeld, T Pape, G Gabrysiak, S Marr, A Bergmann, ...
Proceedings of the fifteenth annual conference on Innovation and technology …, 2010
222010
Estimating the number and sizes of fuzzy-duplicate clusters
A Heise, G Kasneci, F Naumann
Proceedings of the 23rd ACM International Conference on Conference on …, 2014
142014
Applying stratosphere for big data analytics
M Leich, J Adamek, M Schubotz, A Heise, A Rheinländer, V Markl
Datenbanksysteme für Business, Technologie und Web (BTW) 2046, 2013
142013
Reach for gold: An annealing standard to evaluate duplicate detection results
T Vogel, A Heise, U Draisbach, D Lange, F Naumann
Journal of Data and Information Quality (JDIQ) 5 (1-2), 1-25, 2014
112014
Astrid Rheinl änder, Matthias J. Sax, Sebastian Schelter, Mareike Höger, Kostas Tzoumas, and Daniel Warneke. 2014. The stratosphere platform for big data analytics
A Alexandrov, R Bergmann, S Ewen, JC Freytag, F Hueske, A Heise, ...
The International Journal on Very Large Data Bases (VLDBJ) 23 (6), 939-964, 2014
102014
Versatile optimization of UDF-heavy data flows with sofa
A Rheinländer, M Beckmann, A Kunkel, A Heise, T Stoltmann, U Leser
Proceedings of the 2014 ACM SIGMOD International Conference on Management of …, 2014
52014
SOFA: an extensible logical optimizer for udf-heavy dataflows
A Rheinländer, A Heise, F Hueske, U Leser, F Naumann
arXiv preprint arXiv:1311.6335, 2013
52013
Data cleansing and integration operators for a parallel data analytics platform
A Heise
12014
Security Management Platform: Documentation
F Goerke, A Heise, D Jaeger, C Pöpke, S Reichel
Network Security in Practice-Winter Term 9 (10), 0
1
Method and system to discover dependencies in datasets
JAQ Ruiz, F Naumann, A Heise
US Patent App. 14/894,507, 2016
2016
Large-Scale Duplicate Detection
F Naumann, A Heise
Graph Twiddling in a MapReduce World Jonathan Cohen
A Heise, M Leben
Type of Lecture
J Dyck, E Kny, D Beyer, F Thomas, M Grauer, S Viehmeier, T Grütze, ...
The system can't perform the operation now. Try again later.
Articles 1–20