Este artigo é uma revisão interna (rascuho) – em inglês – feita no início da minha dissertação visando levantar o estado da arte da similaridade entre dados semi-estruturados.
Research about similarity between semi-structured documents (particularly for XML documents) has produced many works in the areas of Database Systems, Artificial Intelligence and Data Mining. In this work we introduce a brief survey about it. At first we introduce some basic concepts like similarity types and algorithms. After that, some works are reviewed, highlighting their particularities and general approach. We conclude with a comparison of these works, analysing their benefits and problems.