The fuzzy grouping transformation performs data cleaning tasks by identifying rows of data that are likely to be duplicates and selecting a canonical row of data to use in standardizing the data 模糊分组转换执行数据清理任务,它首先查找可能重复的数据行,然后选择要在对数据进行标准化的过程中使用的规范数据行。
At run time , the fuzzy grouping transformation creates temporary objects such as tables and indexes , potentially of significant size , in the sql server 2005 database that the transformation connects to 在运行时,模糊分组转换会在该转换所连接到的sql server 2005数据库中创建临时对象,例如表和索引,这些表和索引可能会非常大。
Traditional grouping method could not use all attributes of entity and fuzzy correlation space based approach needed more match computation . fuzzy grouping method added two new techniques based on the fuzzy correlation space approach : fuzzy consistent relation based weight distribution and grid based preprocessing 由于传统的分组方法不能充分利用实体所有的属性,而基于模糊关联空间的分组方法需要过多的运算量,本文提出一种模糊分组方法,该分组方法在基于模糊关联空间方法的基础上加入基于模糊一致性的权值分配方法和基于格子的预处理分组方法。