2013-06655562

1

Overall Cost Minimization for Data Aggregation inEnergy-Constrained Wireless Sensor Networks

Wei An†, Song Ci †‡∗, Haiyan Luo�, Dalei Wu�, Yanni Han†, Ying Qi†, and Tao Lin†

†High Performance Network Lab, IOA, Chinese Academy of Sciences, Beijing 100190, China‡Department of CEEN, University of Nebraska–Lincoln, NE 68182, USA

�Cisco Systems, Inc., USA�Massachusetts Institute of Technology, Cambridge, MA 02139, USA

Abstract—In wireless sensor networks (WSNs), sensor nodesare usually powered by batteries of limited capacity, which resultsin dynamic changes of available paths for data aggregation dueto node failures caused by energy depletion. For transmittingcertain amount of data generated by source node, the overalltransmission cost is affected by two major factors, using sequenceof available paths and amount of data imposed on each path,which becomes a major issue significantly influencing the efficientusage of the networks. To address this issue, we consider theoptimization problem of how to minimize the overall transmissioncost of given data delivered from the source node to the sink nodein the energy-constrained WSN. Specifically, we first describethe problem on the basis of the minimum cost flow theory andderive the upper bound for the data amount in terms of thenumber of packets that can be successfully transmitted fromthe source node to the sink node. Then, we propose specificalgorithms to derive the optimal paths and their optimal dataamounts, and then achieve the minimized overall transmissioncost for the certain amount of data. Extensive simulations showthat significant performance enhancement can be achieved byusing our proposed algorithms.

I. INTRODUCTION

Due to the rapid development in electronics and communi-

cations, wireless sensor networks (WSNs) have made plenty of

advances for providing more reliable information with higher

Quality of Service (QoS) at lower costs [1]. Sensor nodes

are usually deployed in an area of interest for monitoring

environments or events, and multi-hop paths are adopted for

transmitting the data from the source nodes to the sink node.

However, sensor nodes are usually powered by limited energy

supply, like battery, which cannot be recharged or replaced in

harsh environments, such as battle fields, nuclear, biological

and chemical contaminated areas. Due to the failures of

sensor nodes caused by energy depletion, available paths are

suffered from dynamical changes, which further affects the

transmission costs of data that are delivered to the sink node.

Consider the topology of a wireless sensor network as

shown in Figure 1. The labels on the links between any two

network nodes are the transmission cost associated with them.

* Corresponding author. Email address: [email protected] work was partially supported by A3 Foresight under Grant No.

11161140319 and by National Science Foundation of China under grantnumbers No. 61201232, by Academy-Locality Cooperation Project of Chinaunder grant No. YDJDBNJ-2012-005 and by US National Science Foundationunder grant No.1145596 and No. 0830493.

(a) Scheme 1 (b) Scheme 2

Fig. 1. Illustration of the topology of a wireless sensor network, where nodes and t are the source and sink nodes, respectively. Other nodes are the relaynodes.

The numbers shown in the rectangles are the available energy

on the corresponding node, represented by how many packets

can be sent before the energy is fully consumed. Here, we

assume the packet size is fixed. As shown in Figure 1(a), we

exploit a greedy strategy to relay given 900 packets, which

may represent the data generated by source node s over a

long period of time, from s to t. We first find the minimum

cost path s→ 4→ 3→ 2→ t by using the traditional routing

algorithms, such as Dijkstra’s algorithm [2]. Suppose nodes sand t have huge energy supply. We impose 500 packets on this

path, and then nodes 2, 3 and 4 will run out of energy. Next,

we find another minimum cost path s→ 6→ 7→ t and then

relay 400 packets with this path. The overall transmission cost

can be calculated as 500×13+400×18 = 13700. However, by

the combined usage of two sub-optimal paths of transmission

cost s→ 1→ 2→ t and s→ 4→ 5→ t in Figure 1(b), 900

packets can be achieved with the overall transmission cost

of 500 × 14 + 400 × 15 = 13000. Obviously, the former

scheme of always using the minimum cost paths cannot reach

the minimum overall transmission cost, because the former

selected minimum cost paths affect the selection of the latter

paths. With this instance, two issues need to be addressed: (1)

since the brute-force search of enumerating all combinations

of available paths is time-consuming, efficient algorithms are

needed to derive the optimal combination of paths; (2) proper

amount of data for each path needs to be determined so as

to minimize the minimum overall cost of given data. In sum,

how to effectively minimize the overall transmission cost of

given data is the issue that will be addressed in this work.

Increasing efforts can be found in literature that focus on the

978-1-4673-3122-7/13/$31.00 ©2013 IEEE

IEEE ICC 2013 - Wireless Networking Symposium

6014

2

enhancement of data transmission cost. Ye et. al [2] exploited

the minimum-cost path for delivering packets from any sensor

to an interested client user in a large sensor network based

on the cost field. They presented a backoff-based cost field

setup algorithm that finds the optimal costs of all nodes to the

sink and then the message, carrying dynamic cost information,

and flows along the minimum cost path in the cost field. In

contrast to the optimizing protocols that burn the energy of

the nodes along the optimal paths, Shah et. al [3] proposed

a new scheme which uses sub-optimal paths of transmission

cost (i.e., transmission energy consumption) occasionally to

provide substantial gains on reducing the transmission cost

and extending the network lifetime. Compared with the above

work which considers only one parameter, Dai et. al [4]

investigated the problem of the bounded-delay minimum-cost

path, which considers both message delay and transmission

cost, and designed an improved heuristic algorithm for solving

the problem. With combination of overall energy consumption

and the network lifetime, Tu et. al [5] presented a compre-

hensive cost function, which takes into account other costs

such as queueing delay and congestion, formulated and solved

the problem of minimizing the overall cost with a lifetime

guarantee. An et. al [6] maximized the network lifetime and

derived the optimal paths for data transmission. However, for

transmitting given data, the existing schemes degrade to the

suboptimal solution of the overall transmission cost because

those schemes were developed with different purposes instead

of minimizing the overall transmission cost of given data.

As aforementioned, the existing work having been done

on transmission cost cannot achieve the optimal transmission

cost for delivering given data from the source node to the

sink node. Thus, we propose a general and low-complexity

scheme based on minimum cost flow theory in this work. With

the proposed scheme, we mathematically calculate the proper

number of packets that going through each sensor nodes. Then

we propose the algorithms to derive the optimal paths and

the optimal data amount for each path. We also analyze the

correctness of the proposed algorithms. Extensive simulation

results are shown to verify the performance enhancement of

the proposed scheme.

The remainder of this paper is organized as follows. In

Section II, we describe the system model and mathematically

formulates the problem. We design efficient algorithms to

achieve the optimal results in Section III. Simulation results

are given in Section IV. Finally, we conclude the paper in

Section V.

II. PROBLEM DESCRIPTION

A. Energy Dissipation Model

Due to sensor node failure caused by energy depletion,

available paths between the source node and the sink node

becomes increasingly scarce over the whole lifetime of the

network. Here, we introduce the First Order Radio (FOR)

model [6]–[8] to calculate the energy consumption of sensor

nodes. In this model, the consumed energy of the transmitter

to deliver a B-bit packet over distance D can be calculated as

Etx = (etx + ed · Dn) · B, (1)

where etx is the energy/bit consumed by the transmitter elec-

tronics, and ed represents the energy/bit/distancen dissipated

in the transmit op-amp. n is the power index for the channel

path loss of the antenna, which depends on the radio frequency

environment. On the receiving side, the amount of energy

that is required to capture the incoming radio signal can be

calculated as

Erx = erx · B, (2)

where erx is the energy/bit consumed by the receiver electron-

ics.

Further, a WSN can be modeled as an undirected graph

G(V,E), where V and E are the node set and the link set,

respectively. A link (u, v) exists if, and only if, nodes u and

v can communicate with each other directly. In this work, the

truncated automatic repeat request (ARQ) protocol, which has

also been used in [9], [10], is adopted to guarantee the quality

of data transmission. Without losing generality, we assume that

the average number of retransmission times between node uand its neighboring nodes is Ku, and all data packets are of

the same size of B bits. Suppose node u relays a B-bit packet

from node v1 to node v2, then the energy consumption of node

u can be calculated according to Eqs.(1) and (2) as

E0 =(Etx + Erx) · Ku=(etx + ed · Dn + erx) · B · Ku. (3)

Let Eu represent the available energy of node u, then the

number of packets that can be delivered by node u can be

predicted by

U(u) =⌊EuE0

⌋, (4)

where � � denotes the flooring operation. We also assume

that the source and sink nodes are equipped with much more

energy compared to relay nodes. This is reasonable from the

cost-effective perspective.

B. Minimum Overall Transmission Cost Problem

Available minimum cost paths between the source node

s and the sink node t determine the overall transmission

cost of the given data that are generated by node s over

a long period of time. Considering that the given data may

be transmitted periodically, here we study the typical case in

which the transmission costs associated with the links are time-

invariant. For instance, distance and number of hops are two

typical time-invariant costs. Due to limited energy supply, the

maximum number of packets going through a given path is

restricted by the node with the minimum available energy. Let

{π1, π2, · · · , πk} be all the paths between s and t, where πiis a path in G(V,E) (1 ≤ i ≤ k). Then, the maximum number

of packets going through path πi can be computed as

U(πi) = minu∈πi

{U(u)} . (5)

In graph G(V,E), two nodes u and v are called connected if

G(V,E) has at least one path between them. Otherwise, they

are called disconnected.

6015

3

Defintion 1: A s–t node-cut Vcut is a set of nodes except sand t satisfying that, if these nodes are removed, nodes s and

t are disconnected [11]. The weight of a s–t node-cut Vcut is

defined as

C(Vcut) =∑u∈Vcut

U(u). (6)

Thus, the minimum weight of the s–t node-cuts can be denoted

as Cmin, which can be derived by using the Ford-Fulkerson

algorithm [11]. According to max-flow min-cut theorem [6],

[11], [12], this weight serves as the upper bound for the

number of packets that can be delivered from the source node sto the sink node t. In this work, we focus on a general problem

of transmitting given C0 packets from the source node s to the

sink node t with 0 < C0 ≤ Cmin. Specifically, we choose the

optimal path for each packet under the constraint of delivering

all the C0 packets from the source node to the sink node. Define

c(u, v) the cost of link (u, v) of each packet, then the path cost

of πi can be calculated as

c(πi) =∑

(u,v)∈πi

c(u, v). (7)

For notation convenience, denote the number of packets

going through πi by h(πi), where h(πi) ≤ U(πi). Thus,

the minimum overall transmission cost (MOTC) problem is

to determine h(πi) such that the overall transmission cost

of delivering the C0 packets is minimized, which can be

formulated mathematically as

mink∑i=1

h(πi) · c(πi) (8)

s.t.:

⎧⎪⎪⎨⎪⎪⎩

k∑i=1

h(πi) · xiv ≤ U(v) for each v ∈ Vk∑i=1

h(πi) = C0where xiv = 1 if node v is on πi; otherwise, xiv = 0 (1 ≤i ≤ k). In this problem, h(πi) = 0 implies that path πi is not

selected for data transmission.

III. OPTIMAL SOLUTION

To facilitate the application of the flow theory, we first

introduce the network conversion, which is firstly described

in our previous work [6] and describes the procedure of

converting an undirected graph into a directed one. The details

are shown in Algorithm 1, whereM is a large positive number

larger than maxu∈V{U(u)}. In the following sections, for given

C0 packets, we develop algorithms for deriving the optimal

paths and the optimal packet amount for each arc in G(N,A),and further obtain the optimal data transmission scheme.

A. Determining Data Amount for Each Arc

Initialization: For each 〈u, v〉 ∈ A, let g(u, v) represent

the current maximum number of packets that can flow from

node u to node v. First, we initialize g(u, v) = U(u, v).According to Algorithm 1, arcs 〈u, v〉 and 〈v, u〉 cannot appear

simultaneously in G(N,A). For conveniences, we can add a

Algorithm 1:Require: G(V,E), U(u) for each u ∈ V and c(u, v) for

each (u, v) ∈ E1: Create graph G(N = Ø, A = Ø)2: for each u ∈ V do3: N = N ∪ {uin, uout};4: A = A ∪ {〈uin, uout〉};5: c〈uin, uout〉 = 0;6: U(uin, uout) = U(u);7: end for8: for each (u, v) ∈ E do9: A = A ∪ {〈uout, vin〉, 〈vout, uin〉}.

10: c〈uout, vin〉 = c〈vout, uin〉 = c(u, v).11: U(uout, vin) = U(vout, uin) =M.

12: end for

Fig. 2. The algorithm for converting undirected network G(V,E) to directedone G(N,A).

new arc 〈v, u〉 in G(N,A) for each 〈u, v〉 ∈ A and initialize

g(v, u) = 0.The main idea of our algorithm is to find the minimum cost

path πi using the Dijkstra’s Shortest Path (DSP) algorithm

each time. And update g(u, v) and g(v, u) constantly by

g(u, v) = g(u, v)−Δ and g(v, u) = g(v, u)+Δ, respectively,

where Δ packets are delivered from u to v and the value of

Δ is determined by the minimum cost path πi. During the

implementation of the DSP algorithm, we only consider the

arcs 〈u, v〉 with g(u, v) > 0. The given data amount, i.e., C0,

of G(N,A) is achieved when Δ ≥ C0. As for Problem Eq.(8),

the first condition is satisfied because g(u, v) is always smaller

than U(u, v), and the second condition is met when C0 = 0.The details of this algorithm are described in Figure 3.

Algorithm 2:Require: C0, G(N,A)

1: for each arc 〈u, v〉 in A do2: g(u, v)← U(u, v)3: g(v, u)← 04: end for5: while C0 > 0 do6: Find the minimum cost path πi from s to t in

G(N,A) using the DSP algorithm.

7: Δ← min{min

〈u,v〉∈πi

{g(u, v)} , C0}

8: for each arc 〈u, v〉 on πi do9: g(u, v)← g(u, v)−Δ

10: g(v, u)← g(v, u) + Δ11: end for12: C0 = C0 −Δ13: end while

Fig. 3. The algorithm for deriving packet amounts of arcs.

Computational complexity: Since C0 is the given data

amount and the DSP algorithm runs O(|N |2) time to find

the minimum cost path πi and Δ is at least 1, the runtime of

Algorithm 2 is upper bounded by O(C0|N |2), where |N | is

6016

4

the number of nodes in N .

B. Deriving the Optimal Data Transmission Scheme

After running Algorithm 2, we derive a value of packet

amount for each arc in G(N,A). Based on the number

of packets on each arc, an algorithm is developed to find

the optimal paths in G(V,E), together with determining the

optimal packet amounts for these paths. For 〈u, v〉 ∈ A,

r(u, v) = U(u, v) − g(u, v) is the number of packets that

have gone through arc 〈u, v〉. For each (u, v) ∈ E, we have

two corresponding arcs 〈uout, vin〉 and 〈vout, uin〉 in A. Thus,

the number of packets that have gone through (u, v) ∈ E can

be calculated as

h(u, v) = |r(uout, vin)− r(vout, uin)|. (9)

Now we can add a new weight of h(u, v) on each link (u, v) ∈E. Based on that, we propose Algorithm 3, shown in Figure 4,

for deriving the optimal data transmission scheme.

Algorithm 3:Require: C0, G(V,E) and G(N,A), g(u, v) for each

〈u, v〉 ∈ A1: T := Ø2: for each (u, v) in E do3: g(u, v)← |r(uout, vin)− r(vout, uin)|;4: end for5: Delete all links (u, v) of h(u, v) = 0 from G(V,E);6: while C0 > 0 do7: Find the optimal path πi from s to t in G(V,E) with

the DSP algorithm.

8: Δ← min(u,v)∈πi

{h(u, v)}9: for each (u, v) on πi do

10: h(u, v)← h(u, v)−Δ11: end for12: C0 = C0 −Δ;

13: Delete all links (u, v) with h(u, v) = 0 on path πi;14: Put the 2-tuple (πi,Δ) into T ;

15: end while

Fig. 4. The algorithm for deriving the optimal data transmission scheme.

For computational complexity, adding weight h(u, v) to

the graph G(V,E) takes O(|E|) time. Further, deriving the

minimum cost path using the DSP algorithm takes O(|V |2)time. Since at least one link is deleted after one minimum cost

path is found, the DSP algorithm runs at most |E| times. Thus,

the computational complexity of searching for the optimal data

transmission scheme is O(|E|) +O(|V |2|E|) = O(|V |2|E|).For the theoretical performance of Algorithms 2 and 3, the

following proposition is given:

Proposition 1: Suppose a set of 2-tuples {(πi, h(πi))}li=1

achieve the minimum transmission cost of C with C =l∑

i=1

c(πi) ·h(πi) < C0. Let Δ = 1 be one packet going through

the minimum cost s–t path πl+1. Then {πi}l+1i=1 achieves the

minimal cost of C′ = C + Δ with C′ =l∑

i=1

c(πi) · h(πi) +c(πl+1).

Due to the limited space, we omit the proof and refer the

reader to [11]. This proposition guarantees that our proposed

algorithm can find the optimal solution of overall transmission

cost for given data.

IV. PERFORMANCE EVALUATION

During the simulations, we randomly distribute sensor nodes

in a 100×100m2 area, where the coordinates of the source

node s and the sink node t are (0, 50) and (100, 50), respec-

tively. The initial energy of each node is set equal to 0.5J.

The energy consumption for the transmitting is set equal to

50nJ/bit, transmitting amplifier 100pJ/bit/m2 and the receiving

circuitry 50nJ/bit. Source node s generates the fixed 2000-bit

packets of the collected data and transits them to node t via

the chosen multi-hop paths. In this work, we generalize the

concept of link cost, which can be anything of time-invariance.

To achieve this, random numbers in the range of [0, 1] are gen-

erated to represent the costs of links. We compare our proposed

scheme (Proposed) with the routing algorithms of minimum

cost forwarding (MCF) [2], energy aware routing (EAR) by

replacing transmission energy by transmission cost [3], and

minimum cost problem with a network lifetime guarantee

(MCPLG) [5]. In the following, we illustrate the performance

of our proposed scheme with two sets of experiments: the

first set is devised for evaluating the overall transmission

cost achievements by compared with above three schemes;

the second set is to illustrate our proposed scheme can also

increase the number of packets transmitted successfully from

the source node s to the sink node t.As for overall transmission cost, we run the first set of

experiments on the network, in which 100 relay nodes are

randomly deployed in the 100×100m2 area, with C0 = 1000,5000 and 9000 packets under difference radii. We allow the

communication radii to change from 30m to 70m for each

set of simulations, at a 10m step size. In order to reflect the

changes of the overall transmission costs under differen given

data, we define the average cost per packet, cavg(C0), which

can be calculated as

cavg(C0) = Q(C0)C0 , (10)

where Q(C0) is the overall transmission cost of transmitting

C0 packets. It is observed in Figure 5 that our proposed

scheme outperforms the compared schemes significantly in

term of the average cost per packet and thus the overall

transmission cost for all C0 = 1000, 5000 and 9000. As C0increases, more and more performance gains are achieved.

Compared with the MCF, EAR and MCPLG, our proposed

scheme achieves only 0.67%, 47.51% and 1.68% gains by

average at C0 = 1000, but makes around 4.81%, 25.06% and

8.49% gains by average at C0 = 5000 and significant 16.42%,

16.90% and 20.32% gains by average at C0 = 9000. This

implies that our proposed scheme works especially well for

allocating the data for available paths over the whole lifetime

of the network.

The second set of experiments is devised for evaluating

the lifetime improvement of our proposed scheme on the net-

works, where 60, 70, 80, 90 and 100 relay nodes are randomly

6017

5

30 40 50 60 700

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Communication Radius: m

Ave

rage

Cos

t Per

Pac

ket:

c avg

ProposedMCFEARMCPLG

(a) C0 = 1000

30 40 50 60 700.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1


Ave

rage

Cos

t Per

Pac

ket:

c avg

ProposedMCFEARMCPLG

(b) C0 = 5000

30 35 40 45 50 55 60 65 700.4

0.5

0.6

0.7

0.8

0.9

1

1.1


Ave

rage

Cos

t Per

Pac

ket:

c avg

ProposedMCFEARMCPLG

(c) C0 = 9000

Fig. 5. Comparison of the average transmission cost per packet of our proposed scheme vs. the MCF, EAR and MCPLG in the wireless sensor networkwith different node transmission radii.

60 65 70 75 80 85 90 95 1000.4

0.6

0.8

1

1.2

1.4

1.6

1.8x 104

Node Number

Pac

ket N

umbe

r

ProposedMCFEARMCPLG

Fig. 6. Comparison of the maximum number of packets that are successfullytransmitted using our proposed scheme on the wireless sensor network ofdifferent number of relay nodes.

deployed on the 100×100m2 area, respectively. We keep run-

ning the system until it reaches its lifetime, i.e. no more path

is available between s and t due to depleted energy. As shown

in Figure 6, our proposed scheme significantly increases the

number of packets, that are transmitted successfully between

the source node s and t, compared to the MCF, EAR and

MCPLG. As for the number of successfully relayed packets,

our proposed scheme can increase up to 50.98%, 30.92% and

53.51% of the number of packets by average, respectively,

compared to the MCF, EAR and MCPLG. This performance

improvement is achieved by the back-flow strategy because it

can allocate proper amount of data for each path and thereby

maximize the number of successfully relayed packets. This

illustrates that our proposed scheme cannot only reduce the

overall transmission cost of the given data generated by the

source node over a long period of time, but also can efficiently

improve the network capability of relaying much more data.

V. CONCLUSIONS

Overall transmission cost minimization is one of the major

challenges for the applications of wireless sensor networks.

In this work, we have considered an optimization problem

of minimizing the overall transmission cost of given data in

energy-limited wireless sensor networks. We have formulated

the problem on the basis of the minimum cost flow theory and

have also discussed the upper bound for the number of packets

that can be successfully relayed from the source node to the

sink node. Then, our algorithms have been proposed to derive

the optimal paths and the optimal data amount for each path.

Extensive simulation results have proved the effectiveness of

the proposed method.

REFERENCES

[1] A. Rao, D. Izadi, R. Tellis, S. Ekanayake, and P. Pathirana, “Datamonitoring sensor network for BigNet research testbed,” in Proc. ofISSNIP, Melbourne, Australia, Dec. 2009, pp. 169–173.

[2] F. Ye, A. Chen, S. Liu, and L. Zhang, “A scalable solution to mini-mum cost forwarding in large sensor networks,” in Proc. of ICCCN,Scottsdale, Arizona, USA, 2001, pp. 304–309.

[3] R. Shah and J. Rabaey, “Energy aware routing for low energy ad hocsensor networks,” in Proc. of IEEE-WCNC, Orlando, Florida, USA,2002.

[4] H. K. Dai and Y. Du, “On the bounded-delay minimum-cost pathproblem of quality-of-service routing,” in Proc. of ICCCN, Honolulu,Hawaii, USA, 2007, pp. 384–390.

[5] L. Tu, H. Hong, and G. Zhou, “Minimum cost routing with a lifetimeguarantee in wireless sensor networks,” in Proc. of GreenCom-CPSCom,Hangzhou, China, 2010.

[6] W. An, H. Luo, S. Ci, and J. Lin, “The maximized relay capacity andoptimal data transmission for wireless sensor networks,” in Proc. ofIEEE GLOBECOM, Houston, Texas, USA, 2011.

[7] Q. Gao, K. J. Blow, D. J. Holding, I. W. Marshall, and X. Peng, “Radiorange adjustment for energy efficient wireless sensor networks,” Ad HocNetworks, vol. 4, no. 1, pp. 75–82, 2006.

[8] Y. Yao, Y. Fan, and H. Luo, “A distributed relay node placement strategybased on balanced network lifetime for wireless sensor networks,” inProc. of WCNIS, Beijing, China, Jun. 2010, pp. 306–310.

[9] D. Wu, S. Ci, and H. Wang, “Cross-layer optimization for video sum-mary transmission over wireless networks,” IEEE Journal on SelectedAreas in Communications, vol. 25, no. 4, pp. 841–850, 2007.

[10] E. Malkamaki and H. Leib, “Performance of truncated type-II hybridARQ schemes with noisy feedback over block fading channels,” IEEETransations on Communications, vol. 48, pp. 1477–1487, Sep. 2000.

[11] C. H. Papadimitriou and K. Steiglitz, Combinatorial Optimization:Algorithms and Complexity. Dover Publications, Inc., 1998.

[12] R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, Network Flows: Theory,Algorithms, and Applications. Prentice-Hall Inc., New Jersey, 1993.

6018

2013-06655562

Documents

Transcript of 2013-06655562