Discovery

共同研究先：University of GroningenAcademic 共同研究数 7

Conference Paper　2016 1 1　

Benchmark datasets for fault detection and classification in sensor data（Last author）

センサーデータにおける故障検出・分類のためのベンチマークデータセット

Bas De Bruijn, Tuan Anh Nguyen, Doina Bucur, Kenji Tei
【抄録】Data measured and collected from embedded sensors often contains faults, i.e., data points which are not an accurate representation of the physical phenomenon monitored by the sensor. These data faults may be caused by deployment conditions outside the operational bounds for the node, and short- or long-term hardware, software, or communication problems. On the other hand, the applications will expect accurate sensor data, and recent literature proposes algorithmic solutions for the fault detection and classification in sensor data. In order to evaluate the performance of such solutions, however, the field lacks a set of benchmark sensor datasets. A benchmark dataset ideally satisfies the following criteria: (a) it is based on real-world raw sensor data from various types of sensor deployments; (b) it contains (natural or artificially injected) faulty data points reflecting various problems in the deployment, including missing data points; and (c) all data points are annotated with the ground truth, i.e., whether or not the data point is accurate, and, if faulty, the type of fault. We prepare and publish three such benchmark datasets, together with the algorithmic methods used to create them: A dataset of 280 temperature and light subsets of data from 10 indoor Intel Lab sensors, a dataset of 140 subsets of outdoor temperature data from SensorScope sensors, and a dataset of 224 subsets of outdoor temperature data from 16 Smart Santander sensors. The three benchmark datasets total 5.783.504 data points, containing injected data faults of the following types known from the literature: random, malfunction, bias, drift, polynomial drift, and combinations. We present algorithmic procedures and a software tool for preparing further such benchmark datasets. © Copyright 2016 by SCITEPRESS- Science and Technology Publications, Lda. All rights reserved.
【抄録日本語訳】組込みセンサーから測定・収集されたデータには、センサーが監視している物理現象を正確に表現していないデータポイント、すなわちフォルトが含まれていることが多い。このようなデータの欠陥は、ノードの動作範囲外の配置条件や、短期的または長期的なハードウェア、ソフトウェア、通信の問題によって引き起こされることがあります。一方、アプリケーションは正確なセンサデータを期待しており、最近の文献ではセンサデータの欠陥検出と分類のためのアルゴリズム的なソリューションが提案されている。しかし、そのような解決策の性能を評価するために、この分野ではベンチマークとなるセンサーデータセットが不足している。ベンチマークデータセットは以下の基準を満たすことが理想的である．(a)様々な種類のセンサデプロイメントから得られた実世界の生センサデータに基づいていること、(b)欠損データポイントを含む、デプロイメントの様々な問題を反映した（自然または人為的に注入した）欠陥データポイントを含んでいること、(c)すべてのデータポイントにグランドトゥルース、つまり、データポイントが正確かどうか、欠陥の場合は、障害の種類を注釈していること、である。我々はこのようなベンチマークデータセットを3つ用意し、その作成に使用したアルゴリズム手法とともに公開する。10個の屋内インテルラボセンサーから得た280個の温度と光のサブセットのデータセット、SensorScopeセンサーから得た140個の屋外温度データのサブセットのデータセット、16個のSmart Santanderセンサーから得た224個の屋外温度データのサブセットのデータセットである。3つのベンチマークデータセットは合計5.783.504データポイントで、文献で知られている次のタイプの注入されたデータフォルトを含んでいる：ランダム、誤動作、バイアス、ドリフト、多項式ドリフト、および組み合わせ。このようなベンチマークデータセットをさらに準備するためのアルゴリズム手順とソフトウェアツールを紹介する。© Copyright 2016 by SCITEPRESS- Science and Technology Publications, Lda. 無断転載を禁じます。

Conference Paper　2015 9 14　IEEE : Institute of Electrical and Electronics Engineers

A self-healing framework for online sensor data（Last author）

オンラインセンサデータ用自己修復フレームワーク

Tuan Anh Nguyen, Marco Aiello, Takuro Yonezawa, Kenji Tei
【抄録】In pervasive computing environments, wireless sensor networks (WSNs) play an important role, collecting reliable and accurate context information so that applications are able to provide services to users on demand. In such environments, sensors should be self-adaptive by taking correct decisions based on sensed data in real-time. However, sensor data is often faulty. Faults are not so exceptional and in most deployments tend to occur frequently. Therefore, the capability of self-healing is important to ensure higher levels of reliability and availability. We design a framework which provides self-healing capabilities, enabling a flexible choice of components for detection, classification, and correction of faults at runtime. Within our framework, a variety of fault detection and classification algorithms can be applied, depending on the characteristics of the sensor data types as well as the topology of the sensor networks. A set of mechanisms for each and every step of the self-healing framework, covering detection, classification, and correction of faults are proposed. To validate the applicability, we illustrate a case study where our solution is implemented as an adaptation engine and integrated seamlessly into the ClouT system. The engine processes data coming from physical sensors deployed in Santander, Spain, providing corrected sensor data to other smart city applications developed in the ClouT project. © 2015 IEEE.
【抄録日本語訳】パーベイシブコンピューティング環境では、無線センサーネットワーク（WSN）が重要な役割を果たし、アプリケーションが要求に応じてユーザーにサービスを提供できるように、信頼性が高く正確なコンテキスト情報を収集する。このような環境では、センサーはリアルタイムで感知したデータに基づいて正しい判断を下すことで、自己適応的になる必要があります。しかし、センサーのデータはしばしば欠陥がある。欠陥はそれほど例外的なものではなく、ほとんどのデプロイメントにおいて、頻繁に発生する傾向がある。したがって、より高いレベルの信頼性と可用性を確保するためには、自己修復機能が重要である。我々は、自己修復機能を提供するフレームワークを設計し、実行時に故障を検出、分類、修正するためのコンポーネントを柔軟に選択できるようにする。このフレームワークでは、センサデータの種類やセンサネットワークのトポロジーの特徴に応じて、様々な障害検出・分類アルゴリズムを適用することができる。また、故障の検出、分類、修正を含む自己修復フレームワークの各ステップのための一連のメカニズムが提案されている。適用性を検証するために、我々のソリューションが適応エンジンとして実装され、ClouTシステムにシームレスに統合されたケーススタディを紹介する。このエンジンは、スペインのサンタンデールに設置された物理センサーからのデータを処理し、ClouTプロジェクトで開発された他のスマートシティアプリケーションに補正されたセンサーデータを提供しています。© 2015 IEEE.

Conference Paper　2013 12 1　IEEE : Institute of Electrical and Electronics Engineers

Fault detection in wireless sensor networks: A machine learning approach（Last author）

ワイヤレスセンサネットワークにおける故障検出: 機械学習によるアプローチ

Ehsan Ullah Warriach, Kenji Tei
【抄録】Wireless Sensor Network (WSN) deployment experiences show that collected data is prone to be faulty. Faults are due to internal and external influences, such as calibration, low battery, environmental interference and sensor aging. However, only few solutions exist to deal with faulty sensory data in WSN. We develop a statistical approach to detect and identify faults in a WSN. In particular, we focus on the identification and classification of data and system fault types as it is essential to perform accurate recovery actions. Our method uses Hidden Markov Models (HMMs) to capture the fault-free dynamics of an environment and dynamics of faulty data. It then performs a structural analysis of these HMMs to determine the type of data and system faults affecting sensor measurements. The approach is validated using real data obtained from over one month of samples from motes deployed in an actual living lab. © 2013 IEEE.
【抄録日本語訳】ワイヤレスセンサネットワーク（WSN）の導入経験では、収集したデータに不具合が発生しやすいことが分かっています。欠陥は、校正、バッテリー低下、環境干渉、センサーの老化など、内部および外部の影響に起因します。しかし、WSNにおける欠陥のあるセンシングデータに対処するためのソリューションはほとんど存在しない。我々は、WSNにおける欠陥の検出と同定のための統計的アプローチを開発する。特に、データおよびシステム障害のタイプの識別と分類は、正確な回復アクションを実行するために不可欠であるため、我々はこれに焦点を当てる。我々の手法は、隠れマルコフモデル（HMM）を用いて、環境の故障のないダイナミクスと故障したデータのダイナミクスを捉える。そして、これらのHMMの構造解析を行い、センサーの測定値に影響を与えるデータとシステムの欠陥の種類を特定する。このアプローチは、実際の生活研究室に配備されたモートから1ヶ月以上のサンプルで得られた実データを用いて検証される。© 2013 IEEE.

Conference Paper　2013 12 1　ACM:Association for Computing Machinery

Applying time series analysis and neighbourhood voting in a decentralised approach for fault detection and classification in WSNs（Last author）

WSNにおける故障検出と分類のための分散型アプローチにおける時系列解析と近傍投票の適用

Tuan Anh Nguyen, Doina Bucur, Marco Aiello, Kenji Tei
ACM International Conference Proceeding Series
【抄録】In pervasive computing environments, wireless sensor networks play an important infrastructure role, collecting reliable and accurate context information so that applications are able to provide services to users on demand. In such environments, sensors should be self-adaptive by taking correct decisions based on sensed data in real-time in a decentralised manner; however, sensed data is often faulty. We thus design a decentralised scheme for fault detection and classification in sensor data in which each sensor node does localised fault detection. A combination of neighbourhood voting and time series data analysis techniques are used to detect faults. We also study the comparative accuracy of both the union and the intersection of the two techniques. Then, detected faults are classified into known fault categories. An initial evaluation with SensorScope, an outdoor temperature dataset, confirms that our solution is able to detect and classify faulty readings into four fault types, namely, 1) random, 2) malfunction, 3) bias, and 4) drift with accuracy up to 95%. The results also show that, with the experimental dataset, the time series data analysis technique performs comparable well in most of the cases, whilst in some other cases the support from neighbourhood voting technique and histogram analysis helps our hybrid solution to successfully detects the faults of all types. Copyright © 2013 ACM.
【抄録日本語訳】パーベイシブコンピューティング環境では、無線センサーネットワークが重要なインフラとしての役割を果たし、アプリケーションが必要に応じてユーザーにサービスを提供できるように、信頼性が高く正確なコンテキスト情報を収集します。このような環境では、センサーは分散型にリアルタイムでセンシングデータに基づく正しい決定を行うことで自己適応的になるべきであるが、センシングデータはしばしば欠陥がある。そこで我々は、各センサノードが局所的に故障を検出する、センサデータの故障検出・分類のための分散化方式を設計する。故障の検出には，近傍投票と時系列データ解析の組み合わせが用いられる．また、2つの手法の和と交の両方の比較精度を研究する。そして、検出された故障は既知の故障カテゴリに分類される。外気温のデータセットであるSensorScopeを用いた初期評価では、我々のソリューションが1）ランダム、2）誤動作、3）バイアス、4）ドリフトの4種類の故障を検出し分類できることが、最大95％の精度で確認された。また、実験データセットでは、ほとんどのケースで時系列データ解析技術が同等の性能を示し、他のケースでは、近傍投票技術やヒストグラム解析のサポートにより、我々のハイブリッドソリューションがすべてのタイプの故障をうまく検出できることが示された。著作権 © 2013 ACM.

Conference Paper　2012 5 8　ACM:Association for Computing Machinery

Fault detection in wireless sensor networks: A hybrid approach

ワイヤレスセンサネットワークにおける障害検出。ハイブリッドアプローチ

Ehsan Warriach, Tuan Anh Nguyen, Kenji Tei, Marco Aiello
【抄録】Wireless Sensor Network (WSN) deployment experiences show that data collected is prone to be imprecise and faulty due to internal and external influences, such as battery drain, environmental interference, sensor aging. An early detection of such faults is necessary for the effective operation of the sensor network. In this preliminary work, we propose a hybrid approach to the detection of faults and we illustrate its performance on data coming from a real sensor deployment. The proposal is a first step to have a hybrid method towards automated on-line fault detection and classification in context-aware WSNs middleware framework. © 2012 ACM.
【抄録日本語訳】ワイヤレスセンサネットワーク（WSN）の導入経験では、バッテリーの消耗、環境干渉、センサーの老朽化など、内部および外部の影響により、収集したデータが不正確になりがちであることが示されています。このような不具合を早期に発見することが、センサネットワークの効果的な運用に必要である。この予備的研究において、我々は障害検出のためのハイブリッドアプローチを提案し、実際のセンサー配置から得られたデータでその性能を説明する。この提案は、コンテキストアウェアWSNsミドルウェアフレームワークにおける自動オンライン故障検出と分類に向けたハイブリッド手法の最初のステップである。© 2012 ACM.

Conference Paper　2012 12 1　IEEE : Institute of Electrical and Electronics Engineers

A machine learning approach for identifying and classifying faults in wireless sensor networks（Last author）

ワイヤレスセンサネットワークにおける故障の特定と分類のための機械学習アプローチ

Ehsan Ullah Warriach, Marco Aiello, Kenji Tei
【抄録】Wireless Sensor Network (WSN) deployment experiences show that collected data is prone to be faulty. Faults are due to internal and external influences, such as calibration, low battery, environmental interference and sensor aging. However, only few solutions exist to deal with faulty sensory data in WSN. We develop a statistical approach to detect and identify faults in a WSN. In particular, we focus on the identification and classification of data and system fault types as it is essential to perform accurate recovery actions. Our method uses Hidden Markov Models (HMMs) to capture the fault-free dynamics of an environment and dynamics of faulty data. It then performs a structural analysis of these HMMs to determine the type of data and system faults affecting sensor measurements. The approach is validated using real data obtained from over one month of samples from motes deployed in an actual living lab. © 2012 IEEE.
【抄録日本語訳】ワイヤレスセンサネットワーク（WSN）の導入経験では、収集したデータに不具合が発生しやすいことが分かっています。欠陥は、校正、電池切れ、環境干渉、センサーの老化など、内部および外部の影響によるものです。しかし、WSNにおける欠陥のあるセンシングデータに対処するためのソリューションはほとんど存在しない。我々は、WSNにおける欠陥の検出と同定のための統計的アプローチを開発する。特に、データおよびシステム障害のタイプの識別と分類は、正確な回復アクションを実行するために不可欠であるため、我々はこれに焦点を当てる。我々の手法は、隠れマルコフモデル（HMM）を用いて、環境の故障のないダイナミクスと故障したデータのダイナミクスを捉える。そして、これらのHMMの構造解析を行い、センサーの測定値に影響を与えるデータとシステムの欠陥の種類を特定する。このアプローチは、実際の生活研究室に配備されたモートから1ヶ月以上のサンプルで得られた実データを用いて検証されている。© 2012 IEEE.

Conference Paper　2012 12 1　IEEE : Institute of Electrical and Electronics Engineers

A hybrid fault detection approach for context-aware wireless sensor networks（Last author）

コンテキストを考慮したワイヤレスセンサネットワークのためのハイブリッド故障検出アプローチ

Ehsan Ullah Warriach, Tuan Anh Nguyen, Marco Aiello, Kenji Tei
【抄録】Wireless Sensor Network (WSN) deployment experiences show that data collected is prone to be imprecise and faulty due to internal and external influences, such as battery drain, environmental interference, sensor aging. An early detection of such faults is necessary for the effective operation of the sensor network. We focus on identifying data fault types and their causes. In particular, we propose a hybrid approach to the detection of faults based on three qualitatively different classes of fault detection methods. Rule-based methods leverage domain and expert knowledge to develop heuristic rules for identifying and classifying faults. Estimation methods predict normal sensor behavior by leveraging sensor spatial and temporal correlations, identifying erroneous sensor readings as faults. Finally, learning-based methods are inferred a model for the faulty sensor readings using training data and statistically detect and identify classes of faults. We illustrate the performance of a hybrid approach on data coming from two actual sensor deployments. © 2012 IEEE.
【抄録日本語訳】ワイヤレスセンサネットワーク（WSN）の導入経験では、バッテリーの消耗、環境干渉、センサーの老朽化など、内部および外部の影響により、収集したデータが不正確になりがちであることが示されています。このような不具合を早期に発見することが、センサネットワークの効果的な運用に必要です。我々は、データ障害の種類とその原因を特定することに焦点を当てる。特に、我々は、3つの質的に異なるクラスの故障検出法に基づく故障検出のためのハイブリッドアプローチを提案する。ルールベースの方法は、故障を識別し分類するためのヒューリスティックなルールを開発するために、ドメインと専門家の知識を活用する。推定法は、センサーの空間的・時間的相関を利用して正常なセンサー動作を予測し、誤ったセンサー読み取りを故障として識別する。最後に、学習ベースの手法は、学習データを用いて欠陥のあるセンサーの読み取りのモデルを推論し、統計的に欠陥のクラスを検出し、識別するものである。我々は、2つの実際のセンサー配置から得られたデータで、ハイブリッドアプローチの性能を説明する。© 2012 IEEE.