This is an old revision of the document!


Data Management Aspects in IoT

Data management is a critical task in IoT. Due to the high number of devices, things, already available (tens of billions), and considering the data traffic generated by each of them through sensor networks, infotainment (soft news) or surveillance systems, mobile social network clients, and so on, we are now in the ZettaByte (ZB 2^70, 10^21 bytes) era. This opened up several new challenges on (IoT) data management, giving rise to data sciences and BigData technologies. Such challenges have not to be considered as main issues to solve, but also as big opportunities fuelling digital economy with new directions such as Cloudonomics[1] and IoTonomics, where data can be considered as a utility, a commodity to properly manage, curate, store, and trade. Therefore, to properly manage data in IoT contexts is not only critical but also of strategic importance for business players as well as for users, evolving into prosumers (producers-consumers).

From a technological perspective, the main aspects of dealing with IoT data management are:

  • Data source: data generation and production is a relevant part of IoT, involving sensors probing the physical system. In a cyber-physical-social system view, such sensors could also be virtual (e.g. software), or even human (e.g. citizens, crowdsensing). Main issues to deal with in data production are related to the type and format of data, heterogeneity in measurements and similar issues. Semantics is the key to solve these issue, also through specific standards such as Sensor Web Enablement and Semantic Sensor Network[2].
  • Data collection/gathering: once data are generated, these should be gathered and made available for processing. The collection process needs to ensure that the data gathered are both defined and accurate so that subsequent decisions based on the findings are valid. Some types of data collection include census (data collection about everything in a group or statistical population), sample survey (collection method that includes only part of the total population), and administrative by-product (data collection is a byproduct of an organisation’s day-to-day operations). Usually, wireless communication technologies such as Zigbee, BlueTooth, LoRa, Wi-Fi and 3-4G networks are used by IoT smart objects and things to deliver data to collection points.
  • Filtering: is a specific preprocessing activity, usually performed at data source or data collector (IoT) nodes (e.g. motes, base stations, hotspots, gateways), aiming at cleaning noisy data, filtering noise and not useful information.
  • Aggregation/Fusion: in order to reduce bandwidth before sending data to processing nodes, these are further elaborated, compressed, aggregated and fused (sensor/data fusion) to reduce the overall volume of raw data to be transmitted and stored.
  • Processing: once data are properly collected, filtered, aggregated, and fused, they can be processed. Processing can be both local and remote, and usually, also include preprocessing activities aiming at preparing data for real processing. Local processing, when possible, is mainly tasked at a fast, lightweight computation on edges (Edge computing), quickly providing results and local analytics. More complex computation are usually demanded to remote (physical or virtual) servers, either provided by local nodes (e.g. communication servers, cloudlets) in a Fog computing fashion, or by Cloud providers as virtual machines hosted in data centres. This kind of computation can also involve historical data, providing global analytics, but hardly meets time-constrained applications and real-time requirements.
  • Storage/Archive: remote servers are also used for permanently store and archive data, making these available for further processing, even to third parties. The database is often used for that, mainly based on distributed, NoSQL key-store technologies to improve reliability and performance.
  • Delivering/Presentation/Visualization: the results of processing activities have to be then delivered to requestors and users. These have to be therefore properly organised and formatted, ready for end-users. IoT data visualisation is becoming an integral part of the IoT. Data visualisation provides a way to display this avalanche of collected data in meaningful ways that clearly present insights are hidden within this mass amount of information.
  • Security and privacy: data privacy and security are among the most critical issues to address in IoT data management. Good results and reliable techniques for secure data transmission, such a TLS and similar, are available. This way, IoT data security issues mainly concern[3] securing IoT devices, since they are usually resource constrained and therefore do not allow to adopt traditional cryptography scheme to data encryption/decryption. Data privacy and integrity should also be enforced in remote storage servers, anonymising data as well as allowing owners to properly manage (monitoring, removing) them while ensuring availability. Indeed, security and privacy issues vertically span into the whole IoT stack. A promising technique to address IoT security issues, attracting growing interests from both academic and business communities, is blockchain [4].
en/iot-open/introduction/introduction_to_data-related_design_questions_of_iot.1558469567.txt.gz · Last modified: 2020/07/20 09:00 (external edit)
CC Attribution-Share Alike 4.0 International
www.chimeric.de Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0