740 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 16, NO. 6, JUNE 2004 On Computing Mobile Agent Routes for Data Fusion in Distributed Sensor Networks
Qishi Wu, Member, IEEE, Nageswara S.V. Rao, Senior Member, IEEE, Jacob Barhen, Member, IEEE, S. Sitharama Iyengar, Fellow, IEEE, Vijay K. Vaishnavi, Fellow, IEEE, Hairong Qi, Member, IEEE, and Krishnendu Chakrabarty, Senior Member, IEEE
Abstract—The problem of computing a route for a mobile agent that incrementally fuses the data as it visits the nodes in a distributed sensor network is considered. The order of nodes visited along the route has a significant impact on the quality and cost of fused data, which, in turn, impacts the main objective of the sensor network, such as target classification or tracking. We present a simplified analytical model for a distributed sensor network and formulate the route computation problem in terms of maximizing an objective function, which is directly proportional to the received signal strength and inversely proportional to the path loss and energy consumption. We show this problem to be NP-complete and propose a genetic algorithm to compute an approximate solution by suitably employing a two-level encoding scheme and genetic operators tailored to the objective function. We present simulation results for networks with different node sizes and sensor distributions, which demonstrate the superior performance of our algorithm over two existing heuristics, namely, local closest first and global closest first methods.
Index Terms—Genetic algorithms, mobile agents, distributed sensor networks. æ
1INTRODUCTION
ULTIPLE sensor systems have been the target of active amount of data of various modalities, which makes it Mresearch since the early 1990s [1] with a particular critical to collect only the most important data and to collect emphasis on the information fusion methods for distributed it efficiently. In addition, despite the abundance of sensor networks (DSNs) [2], [3], [4]. Recent developments in deployed sensors, not all sensor data is critical to ensuring the sensor, networking, and computing areas now make it the quality of fused information such as adequate detection possible to deploy a large number of inexpensive and small energy for target detection or tracking. sensors to “achieve quality through quantity” in complex In conventional fusion architectures, all the sensor data is applications. In an important subclass of DSNs that are sent to a central location where it is fused. But, the deployed for remote operations in large unstructured transmission of noncritical sensor data in military DSN geographical areas, wireless networks with low bandwidth deployments increases the risk of being detected in addition are usually the only means of communication among the to consuming the scarce resources such as battery power sensors. These sensors are typically lightweight with and network bandwidth. To meet these new challenges, the limited processing power, battery capacity, and commu- concept of mobile agent-based distributed sensor networks nication bandwidth. The communication tasks consume the (MADSNs) has been proposed by Qi et al. [5] wherein a limited power available at such sensor nodes and, therefore, mobile agent selectively visits the sensors and incrementally in order to ensure their sustained operations, the power fuses the appropriate measurement data. Mobile agents are consumption must be kept to a minimum. Furthermore, the special programs that can be dispatched from a source node massively deployed sensors typically generate a huge to be executed at remote nodes. Upon arrival at a remote node, a mobile agent presents its credentials, obtains access to local services and data to collect needed information or perform certain actions, and then departs with the results. . Q. Wu, N.S.V. Rao, and J. Barhen are with the Center for Engineering Science Advanced Research, Computer Science and Mathematics Division, One of the most important aspects of mobile agents is the Oak Ridge National Laboratory, One Bethel Valley Road, PO Box 2008, security, which is not addressed here, but is being actively MS-6016, Oak Ridge, TN 37831-6016. investigated [6], [7]. The transfer of partial fusion results by E-mail: {wuqn, raons, barhen}@ornl.gov. . S.S. Iyengar is with the Department of Computer Science, Louisiana State a mobile agent may still have the risk of being spied on with University, Baton Rouge, LA 70803. Email: [email protected]. hostile intent; the serial data collection process employed by . V.K. Vaishnavi is with the Department of Computer Information Systems, the mobile agent obviously decreases the chance of Georgia State University, PO Box 4015, Atlanta, GA 30302-4015. exposing the individual raw data. Although there are E-mail: [email protected]. . H. Qi is with the Electrical and Computer Engineering Department, advantages and disadvantages of using mobile agents [8] University of Tennessee, Knoxville, TN 37996. E-mail: [email protected]. in a particular scenario, their successful applications range . K. Chakrabarty is with the Department of Electrical and Computer from e-commerce [9] to military situation awareness [10]. Engineering, Duke University, Box 90291, 130 Hudson Hall, Durham, NC 27708. E-mail: [email protected]. They are found to be particularly useful for data fusion tasks in DSN. The motivations for using mobile agents in Manuscript received 11 June 2003; revised 4 Dec. 2003; accepted 16 Jan. 2004. For information on obtaining reprints of this article, please send e-mail to: DSN have been extensively studied [5]. For a particular [email protected], and reference IEEECS Log Number TKDE-0092-0603. multiresolution data integration application, it is shown
1041-4347/04/$20.00 ß 2004 IEEE Published by the IEEE Computer Society WU ET AL.: ON COMPUTING MOBILE AGENT ROUTES FOR DATA FUSION IN DISTRIBUTED SENSOR NETWORKS 741 that a mobile-agent implementation saves up to 90 percent power needed for communication and the path losses. In of the data transfer time due to savings in avoiding the raw particular, the objective function is directly proportional data transfers. Also, the conditions under which an to the signal energy and inversely proportional to the MADSN performs better than a DSN are analyzed and path loss and cost of communication. We show MARP to the conditions for an optimum performance of the former be NP-complete by using a reduction from a variation of are determined in [5]. In this paper, we direct our research the 3D traveling salesman problem. We then propose an efforts to the networking aspects of MADSNs with a focus approximate solution based on a genetic algorithm (GA) on a new routing method for mobile agents. that employs a two-level encoding scheme and genetic The order and number of nodes on the route traversed operators tailored to the objective function, which aims to by a mobile agent determine the energy consumption, path achieve the total signal energy equal to or above a given loss, and detection accuracy and, hence, have a significant level. Simulation results for networks with different node impact on the overall performance of a MADSN. Low sizes and sensor distributions show that our algorithm energy consumption, low path loss, and high detection has superior performances compared to LCF and GCF. accuracy are always the main goals of a DSN. The The rest of the paper is organized as follows: In Section 2, computation of a suitable route, in practice, involves a models for sensor nodes and wireless communication links trade off between the cost (energy consumption and path are described and then the mobile agent routing problem is loss) and the benefit (detection accuracy): Visiting more formulated. The details of the genetic algorithm are sensors improves the quality of fused data, but also raises discussed in Section 3, including a two-level encoding the communication and computing overheads. Previously, scheme, derivation of the objective function, and imple- algorithms based on the local closest first (LCF) and global mentations of genetic operators. Simulation results are closest first (GCF) heuristics have been used to compute presented and discussed in Section 4. Concluding remarks routes for mobile agents in MADSNs [5]. Their performance are provided in Section 5. has been quite satisfactory for small DSNs that are system- atically deployed in simple environments. However, their 2MOBILE AGENT ROUTING PROBLEM performance deteriorates as the network size grows and the We now briefly describe the architecture of an MADSN to sensor distributions become more complicated. In some motivate the formulation of the route optimization problem. practical DSNs deployed for target detection and tracking, An MADSN typically consists of three types of components: these two heuristics could generate particularly unsatisfac- processing elements (PE), sensor nodes, and communication tory solutions since they only consider the spatial distances network [11]. The processing elements and sensors are between sensor nodes. In these applications, several other usually interconnected via a wireless communication net- factors, such as sensor detection signal levels and link work. A group of neighboring sensor nodes that are power consumption, play a very significant role and, commanded by a single PE forms a cluster. indeed, could be far more important than the physical distance alone. It is very critical to consider all these factors 2.1 Sensor Nodes simultaneously in the data fusion process for computing a A sensor node, also referred to as a leaf node, is the basic satisfactory route of a mobile agent. functional unit for data collection in an MADSN. A sensor We consider DSNs with geographically distributed node may have several channels with different sensory sensors that are deployed for the purposes of target devices connected to each of them. Sensor nodes are classification and tracking, both of which require the geographically distributed and collect measurements of fusion of measurements from a number of sensors. The different modalities such as acoustic, seismic, and infrared measurements from the sensors that receive strong signals from the environment. The data acquisition is controlled by from the target are often most useful in the fusion process a sampling subsystem, which provides the acquired data to and, thus, the fusion step is preceded by identifying a the main system processor [12]. The signal energy from subset of sensor nodes that “strongly” indicate the each channel can be detected individually and processed in presence of a target. By identifying such sensor nodes, the analog front end. A mobile agent migrates among the the complexity and amount of sensor data for fusion sensor nodes via the network, integrates local data with a operation can be significantly reduced. Thus, we focus on desired resolution sequentially, and carries the final result identifying a route for a mobile agent through such to the originating PE. The fused data may be used to derive sensor nodes by utilizing the signal strengths of the appropriate inferences about the environment based on the sensor nodes. The mobile agent then visits these nodes application. The setup time of a PE accounts for loading the and performs the required fusion of the sensor informa- mobile agent code and performing other initialization tasks. tion available at these nodes. Each sensor can transmit The amount of signal energy that reaches an individual messages with certain energy costs, and the transmissions sensor is an effective indicator of how close the node is to a are subject to wireless propagation losses. We formulate potential target. Once the signal is captured and processed the mobile agent routing problem (MARP) in an MADSN as by a sensor node, the strength level of the detected signal is a combinatorial optimization problem involving the cost broadcast (through an omnidirectional antenna) to the of communication, path loss due to wireless propagation, whole cluster so that a PE in any location has a knowledge and signal energy received by the sensors. The overall of various levels of signal energy detected by all active routing objective is to maximize the sum of the signal sensor nodes within the coverage area. In target detection energy received at the visited nodes while minimizing the applications, a leaf node with higher signal energy carries 742 IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 16, NO. 6, JUNE 2004
Fig. 1. A simple MADSN with one PE and 10 leaf nodes. more information and should have higher priority of being An MADSN with a simple configuration is shown in visited. To simplify computation, we use a quantitative Fig. 1 for illustrative purposes. The sensor network value to represent the level of signal energy detected by a contains one PE, labeled as S0, and N ¼ 10 leaf nodes, local sensor node. labeled Si;i¼ 1; 2; ...;N, one of which is down. The sensor nodes are spatially distributed in a surveillance 2.2 Communication Links region of interest, each of which is responsible for Wireless communication links need to be established collecting measurements in the environment. The signal between neighboring nodes as the mobile agent migrates energy detected by sensor node Si is denoted by along a route. The embedded RF modems on sensor nodes si;i¼ 1; 2; ...;N. Sensor node Si takes time ti;acq for data provide such a networking capability with low power acquisition and time ti;proc for data processing. The requirement. For example, on WINS NG 2.0 platform [12], wireless communication link with physical distance di;j each node is equipped with two RF modems, both of which between sensor node Si and Sj has channel width W bits support 2.4 GHz frequency-hopped spread spectrum (FHSS) and operates at frequency B Hz. Some sensor nodes may communication. The different clusters select different “net- be down temporarily due to intermittent failures, as sensor work numbers,” which correspond to separate hopping S9 in Fig 1. pseudonoise sequences to avoid interferences. We define an The routing objective is to find a path for a mobile agent abstract link class with only the parameters we are that satisfies the desired detection accuracy while minimiz- interested in. The detailed radio configuration and wireless ing the energy consumption and path loss. The required link establishment is beyond the scope of the paper. path is computed based on the current state of DSN and the It is worthwhile to note that the message transmission mobile agent traverses the nodes along the path while time between two sensor nodes depends not only on the performing the fusion operation. The energy consumption physical distance between them, but also on the channel depends on the processor operational power and computa- bandwidth and the data packet loss rate as well as the size tion time and the path loss is directly related to the physical of the message, which includes partially integrated data and length of the selected path. We define these quantities in the mobile agent code itself. In general, the electromagnetic next section. propagation time is almost negligible in short-range 2.4 Objective Function wireless communication. Hence, the physical distance is not explicitly considered in our model, but is incorporated The objective function for the mobile agent routing problem as a part of the path loss representing the signal attenuation. is based on three aspects of a routing path: energy The received signal strength below a certain level due to consumption, path loss, and detected signal energy: path loss may not be acceptable. The system loss factor is a 1. Energy consumption. The energy consumption at parameter of the free space propagation model, which is not a sensor node is determined by the processing necessarily related to the physical propagation [13]. speed and the computation time. If an energy- driven Real-Time Operating System (RTOS) is 2.3 Mobile Agent Routing installed on the sensor node, the processor speed A mobile agent is dispatched from a processing element and can be dynamically scaled depending on workload is expected to visit a subset of sensors within the cluster to and task deadlines [15]. For wireless message fuse data collected in the coverage area. Generally, the more transmissions, the energy consumption depends sensors visited, the higher the accuracy achieved using any on the sensor’s transmission power and message reasonable data fusion algorithm will be [14]. But, it is transmission time. We assume that the message important to select an appropriate route so that the required includes the mobile agent code of size M bits and signal energy level can be achieved with a low cost in terms measured data of size D bits. For a given desired of total energy consumption and path loss. resolution, a fixed data size D is used to store the WU ET AL.: ON COMPUTING MOBILE AGENT ROUTES FOR DATA FUSION IN DISTRIBUTED SENSOR NETWORKS 743
partially integrated data at each sensor. The time decision in target detection. The sum of the detected for the message to be transmitted over a wireless signal energy SE along path P is defined as: channel of bandwidth BW bps is calculated as: HX 1 M þ D SEðPÞ¼ sP½k ; t ¼ : msg BW k¼1 where s is the signal energy detected by the The energy consumption EC of path P, consisting of P½k kth sensor node on path P. nodes P½0 ;P½1 ; ...;P½H 1 , is defined as: By combining the above three aspects of a routing path, 2 ECðPÞ¼a ðt0;setup þ t0;procÞ F0 þ P0;t tmsg we consider an objective function as follows: HX 1 2 1 1 þ b ðtP½k :acq þ tP½k ;procÞ F þ PP½k ;t OðPÞ¼SEðPÞ þ ; P½k ECðPÞ PLðPÞ k¼1 wherein the terms SEðPÞ, ECðPÞ, and PLðPÞ are assumed tmsg ; to be “normalized” to appropriately reflect the contribution by various loss terms. Intuitively, this objective function where the kth leaf node SP½k on path P has data prefers paths with higher signal energies by penalizing acquisition time tP½k ;acq, data processing time tP½k ;proc, those with high path losses and energy consumption. For operational level FP½k , transmitting power PP½k ;t; and node P½0 ¼0 corresponds to the PE. Here, the the detection applications, we are interested in concluding operational level refers to the processor operating the presence of a target in the monitoring area, which is frequency, the square of which determines its determined by a threshold level of detected energy. For corresponding operating power level. Coefficients a example, this threshold can be determined to maximize the and b are suitably chosen to “normalize” the probability of detection while keeping the false alarm rate operational level to its power level. below a specified quantity. In particular, a path providing high signal energy at the expense of a considerable amount 2. Path loss. The power received by sensor Sj has the following relation with the power transmitted by of energy consumption and path loss may not be preferable. sensor Si according to the Friis free space propaga- Note that alternative objective functions may be used as tion model [13]: long as they correctly reflect the trade off between detected signal energy, energy consumption, and path loss. 2 Gi;tGj;r To facilitate the design of the genetic algorithm in P ðd Þ¼P ; j;r i;j i;t 2 2 ð4 Þ di;j Section 3, we define a fitness function based on the above objective function as follows: where Gi;t is the gain of sensor Si as a transmitter and Gj;r is the gain of sensor Sj as a receiver. fðPÞ¼OðPÞþg; c Wavelength is the ratio of speed of light and where g is the penalty function for overrunning the f carrier frequency and is the system loss factor. constraint and is defined by: The physical distance di;j between Si and Sj is computed from their spatial locations. Path loss (PL) 0;SEðPÞ E g ¼ represents the signal attenuation as a positive ðSEðPÞ EÞ=E SEðPÞ