Literaturverzeichnis

[Anan03]
R. Ananthakrishna, S. Chaudhuri, V. Ganti (2003): Eliminating Fuzzy Duplicates in Data Warehouses.Hong Kong.
[Balouch03]
A. Balouch, A. Heuer,H. Meyer (2003): Schema Integration und XQuery Core in Digitalen Bibliotheken. Lehrstuhl Datenbank- und Info-systemen, Rostock.
[Bhattdmkd04]
Indrajit Bhattacharya, Lise Getoor (2004): Iterative Record Linkage for Cleaning and Integration. .Maryland.
[Bilen03]
Mikhail Bilenko, Raymond J. Mooney (): Adaptive Duplicate Detection Using Learnable String Similarity Measures. ..
[Brockhaus99]
Brockhaus(1999): Brockhaus. Die Enzyklopädie in 24 Bänden.. Bibliographisches Institut & F. A. Brockhaus AG., Mannheim.
[Cai05]
Rita Sharma, David Poole (): Probability and Equality: A Probabilistic Model of Identity Uncertainty. .British Columbia.
[Callumnigamco]
A. McCallum, K. Nigam, J. Rennie, K. Seymore (1999): A Machine Learning Approach to Building Domain-Specific Search Engines. .Pittsburg.
[Callumnigamc2]
A. McCallum, K. Nigam, J. Rennie, K. Seymore (2000): Automating the Construction of Internet Portals with Machine Learning. .Pittsburg.
[CDay]
C. Day (1997): A Checklist for Evaluating Record Linkage Software. .,.
[CIAC2000]
Yusuke Shibata, Takuya Kida, Shuichi Fukamachi et al. (2000):Speeding Up Pattern Matching by Text Compression. Kyushu University 33.Dept. of Informatics, Fukuoka 812-8581.
[Cohen03]
W. W. Cohen, P.Ravikumar, St. E. Fienberg (2003): A Comparison of String Distance Metrics for Name-Matching Tasks. American Association for Artificial Intelligence..
[Cupid01]
E. Rahm, P.A. Bernstein, J. MadhavanGeneric Schema Matching with Cupidt. Proceedings of the 27th VLDB Conference (2001)
[Doan01]
AnHai Doan, Ying Lu, Yoonkyong Lee, Jiawei Han (2001): Object Matching for Information Integration: A Profiler-Based Approach. .Illinois.
[DuplDet06]
Tobias Ackermann, Florian WeigoldDuplicate Detection using Machine Learning2006
[Febrl05]
Peter Christen, Tim Churches(2003): Febrl - freely extensible biomedical record linkage. Technical Manual.
[Fellegi69]
Fellegi, I. P., Sunter, A. B. (1969): A Theory for Record Linkage. Journal of the American Statistical Association.,Washington.
[Ferber03]
Reginald Ferber(2003): Information Retrieval - Suchmodelle und Data- Mining.. dpunkt Verlag, Heidelberg.
[Frakes04]
W.B. Frakes, R. Baeza-Yates(2004): Information Retrieval: Data Structures & Algorithms.. unknown, .
[Gamma94]
Erich Gamma, Richard Helm, Rslph Johnson, John Vlissides(1994): Design Patterns - Elements of Reusable Object-Oriented Software.. Addison-Wesley, Person, 201 W. 103rd Street, indianapolis, IN 46290.
[Garcia05]
Hector Garcia-MolinacHandling Data Quality in Entity Resolution2005
[Giunchiglia04]
F. Giunchiglia, P. Shvaiko (2004): Semantic Matching. University of Trento, Italien.
[glue]
Matthias Kerzel (2003): GLUE - Learning to map between ontologies on the semantic web.
[Gravano03]
L. Gravano, P.G. Ipeirotis, N. Koudas, D. Srivastavat (2003): Text Joins in an RDBMS for Web Data Integration. Columbia University..
[Haskell98]
P. Hudak, J. Peterson, J.H. Fasel (1999): A Gentle Introduction to Haskell 98. Yale University, Department of Computerscience..
[Hjelm01]
Hjelm, Johan(2001): Creating the semantic Web with RDF.Creating the semantic Web with RDF. Wiley, New York.
[HS95]
M. Hernandez, S. StolfoThe merge/purge problem for large databases. (1995)
[Inte03]
M.Bilenko, R.Mooney, W.Cohen, P. Ravikumar, St.Fienberg (2003): Adaptive Name Matching in Information Integration. Information Integration on the Web.,.
[Ishkur06]
Keneth Taylor(2006): Ishkurs Guide to electronic music. www.di.fm/edmguide/edmguide.html
[JSM2002]
Marco Fortini, Alessandra Nuccitelli, Brunero Liseo, Mauro Scanu (2002): Modelling Issues in Record Linkage: a Bayesian Perspective. .Rom.
[Kalfoglou05]
Y. Kalfoglou, M. Schorlemmer (2005): Ontology Mapping: The State of the Arth. Dagstuhl Seminar Proceedingsa..
[Kdd03]
Rohan Baxter, Peter Christen, Tim Churches (2003): A Comparison of fast Blocking Methods for Record Linkage. .Canberra.
[Keogh04]
Eamonn Keogh, Stefano Lonardi, Chotirat Ann RatanamahatanaTowards Parameter-Free Data Mining2004
[Klein]
Michel Klein (2001): Combining and relating ontologies - an analysis of problems and solutions. Vrije Universiteit Amsterdam..
[Kordon05]
Dr.-ing. Ullrich Kordon (2005): Vorlesung Sprachsynthese. TU Dresden.
[Lev06]
Die Levenshtein-Distanz (): www.levenshtein.de
[Meh05]
Dipl.-Wi.-Ing. Marc Ehrig (2005): Ontology Alignment: Bridging the semantical gap.
[Mehr04]
M.Ehrig, P.Haase, M.Hefke, N.Stojanovic (2004): Similarity for Ontologies - a Comprehensive Framework. Institute AIFB, University of Karlsruhe..
[Monge97]
A.E.Monge, C.P. Elkan (1997): An efficient domain-indendent algorithm for detecting approximately duplicate database records. University of California, San Diego.
[Mooney03]
Mikhail Bilenko, Raymond J. Mooney (2003): On Evaluation and Training-Set Construction for Duplicate Detection
[NahmBilenk02]
Un Yong Nahm, Mikhail Bilenko, Raymond J. MooneyTwo Approaches to Handling Noisy Variation in Text Mining. (2002)
[Niermann03]
Andrew Nierman H. V. Jagadish (2003): ProTDB: Probabilistic Data in XML. University of Michigan.
[Nigam01]
Kamal Paul NigamUsing Unlabeled Data to Improve Text Classification2001
[Nigam2000]
Andrew McCallum, Kamal Nigam, Lyle H. UngarEfficient Clustering of High- Dimensional Data-Set with Application to Refer
[Noy05]
Natasha NoyOntology Mapping and Alignment2005
[OLA04]
J. Euzenat, D. Loup, M. Touzani, P. Valtchev (2004): Ontology alignment with OLA. University of Montréal..
[onto06]
Wikipedia(4.2006): Ontologie (Informatik).
[openthesaur06]
Daniel Naber(2006): OpenThesaurus - Deutscher Thesaurus. www.openthesaurus.de
[Portnoy00]
Leonid Portnoy (2000): Intrusion detection with unlabeled data using clustering. Columbia.
[Rahm01]
E. Rahm, P.A. Bernstein(2001): A survey of approaches to automatic schema matching.
[Rahm02]
H.-H. Do, S. Melnik, E. Rahm (2002): Comparison of Schema Matching Evaluations. University of Leipzig..
[Rahm04]
Erhard Rahm, Hong Hai Do (): Data Cleaning: Problems and Current Approaches. .University of Leipzig, Germany.
[RistadYanilo96]
Eric Sven Ristad, Peter N. Yainilos (): Learning Edit String Distance. .Princeton University.
[Schaffert04]
S. Schaffert, F. Bry:Querying the Web Reconsidered (2004)
[Schefels06]
Clemens SchefelsServing Xcerpt to the Web2006
[Schreiber06]
M. SchreiberNeighbourhood-conscious Record Linkage.(2006)
[Secstring06]
William W. Cohen, Pradeep Ravikumar, Stephen Fienberg(2006): SecondString. secondstring.sourceforge.net
[SF06]
(2006): SourceForge. www.sourceforge.net
[Simmetric06]
Sam Chapman(): Sam's String Metrics. http://www.dcs.shef.ac.uk/~sam/stringmetrics.html
[Soundex00]
The U.S. National Archives and Records Administration(2000): The Soundex Indexing System. www.archives.gov/genealogy/census/soundex
[Spire02]
I. Bartolini, P. Ciaccia, M. Patellai (2002): String Matching with Metric Trees. Using an Approximate Distance. University of Bologna, Italien..
[Stoilos05]
Giorgos Stoilos, Giorgos Stamou, and Stefanos Kollias (2005): A String Metric for Ontology Alignment. Springer-Verlag.Berlin Heidelberg.
[Studer05]
R. Studer, M. Ehrig, Y. Sure (2005): Automatische Wissensintegration mit Ontologien. Institut AIFB, Universität Karlsruhe..
[Sven06]
Hans Eric SvenssonExtending Xcerpt with Ontology Queries.(2004)
[Tailor02]
M. G. Elfeky, V.S.Verykios, A. K. ElmagarmidTAILOR: A Record Linkage Toolbox2002
[vanReijsenn75]
Cornelis Joost van Rijsbergen(1975): Information retrieval.. Butterworths, London (UK).
[Vldb01]
Luis Gravano, Panagiotis G. Ipeirotis, H. V. Jagadish, Nick Koudas (2001): Approximate String Joins in a Database (Almost) for Free. AT&T.Columbia University.
[weka06]
Dr. Eibe Frank et.al.(2006): Weka 3: Data Mining Software in Java. www.cs.waikato.ac.nz/~ml/
[Wiki06]
(2006): Precision and Recall. de.wikipedia.org/wiki/Precision
[wikisound]
Wikipedia(2006): Soundex. de.wikipedia.org/wiki/Soundex
[Wilk06]
H. E. Svensson, E. Wilk (2006): XML Querying Using Ontological Information. Linköping University, Schweden..
[Winkler00]
William E. Winkler (2000): Frequency-Based Matching in Fellegi-Sunter Model of. .Washington.
[Winkler03]
William E. Winkler (2003): Data Cleaning Methods. .Washington.
[Winkler90]
W. E. Winkler (1990): An Application of the FELLEGI-SUNTER Model of Reco. Washington.
[Winkler91]
William E. Winkler (): Record Linkage Software and Methods for Merging A. Washington.
[Winkler93]
William E Winkler (1993): Matching and Record Linkage. .Washington.
[Winkler99]
W.E. Winkler (1999): The State of Record Linkage and Current Research Problems. U. S. Bureau of the Censuso..
[wordnet06]
George A. Miler(2006): WordNet - a lexical database for the English language. wordnet.princeton.edu
[Wu03]
Tianhao Wu (2003): Theory and Algorithms for Information Extraction and Classification in Text. .CSE Department, Lehigh University.
[Xcerpt04]
S.Schaffert (2004): Xcerpt: A Rule-Based Query and Transformation Language for the web.
[XcerptFact05]
S. Berger, F. Bry, T. Furche, B. Linse, S. SchaffertThe Web and semantic web query language Xcerpt2005
[XcerptUseCase04]
Sebastian Kraus (2004): Use Cases für Xcerpt - eine positionelle Anfrage und Transformationssprache für das Web
[Zhu01]
J. Joanne Zhu, Lyle H. Ungar (): String Edit Analysis for Merging Databases. Philadelphia.
top