Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
en:iot-reloaded:clustering_models [2024/12/10 21:34] pczekalskien:iot-reloaded:clustering_models [2024/12/10 21:34] (current) pczekalski
Line 22: Line 22:
 ==== Data preprocessing before clustering ==== ==== Data preprocessing before clustering ====
  
-Before starting clustering, several important steps have to be performed:+Before starting clustering, several necessary steps have to be performed:
  
   * **Check if the used data is metric:** In clustering, the primary measure is Euclidian distance (in most cases), which requires numeric data. While it is possible to encode some arbitrary data using numerical values, they must maintain the semantics of numbers, i.e. 1 < 2 < 3. Good examples of natural metric data are temperature, exam assessments, and the like—bad examples are gender and colour.   * **Check if the used data is metric:** In clustering, the primary measure is Euclidian distance (in most cases), which requires numeric data. While it is possible to encode some arbitrary data using numerical values, they must maintain the semantics of numbers, i.e. 1 < 2 < 3. Good examples of natural metric data are temperature, exam assessments, and the like—bad examples are gender and colour.
en/iot-reloaded/clustering_models.1733866446.txt.gz · Last modified: 2024/12/10 21:34 by pczekalski
CC Attribution-Share Alike 4.0 International
www.chimeric.de Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0