Taming Wireless Fluctuations by Predictive Queuing Using a Sparse-Coding Link-State Model
Stephen J. Tarsa
Harvard University Cambridge, MA

Marcus Comiter
Harvard University Cambridge, MA

Michael B. Crouse
Harvard University Cambridge, MA

Bradley McDanel
Harvard University Cambridge, MA

H. T. Kung
Harvard University Cambridge, MA

ABSTRACT
We introduce State-Informed Link-Layer Queuing (SILQ), a system that models, predicts, and avoids packet delivery failures caused by temporary wireless outages in everyday scenarios. By stabilizing connections in adverse link conditions, SILQ boosts throughput and reduces performance variation for network applications, for example by preventing unnecessary TCP timeouts due to dead zones, elevators, and subway tunnels. SILQ makes predictions in real-time by actively probing links, matching measurements to an overcomplete dictionary of patterns learned oﬄine, and classifying the resulting sparse feature vectors to identify those that precede outages. We use a clustering method called sparse coding to build our data-driven link model, and show that it produces more variation-tolerant predictions than traditional loss-rate, location-based, or Markov chain techniques. We present extensive data collection and ﬁeld-validation of SILQ in airborne, indoor, and urban scenarios of practical interest. We show how oﬄine unsupervised learning discovers link-state patterns that are stable across diverse networks and signal-propagation environments. Using these canonical primitives, we train outage predictors for 802.11 (Wi-Fi) and 3G cellular networks to demonstrate TCP throughput gains of 4x with oﬀ-the-shelf mobile devices. SILQ addresses delivery failures solely at the link layer, requires no new hardware, and upholds the end-to-end design principle to enable easy integration across applications, devices, and networks.
Davis'Sq.' Alewife' Porter'Sq.' Harvard'Sq.'

Urban&Subway&Scenario&
Inbound'
Wireless&Environment&
Alewife' Porter'Sq.' Harvard'Sq.' Davis'Sq.'

Outbound(

Central'Sq.'

Above?Ground'
Link%Loss%≤%10%%
Charles/MGH'

Central'Sq.'

Above?Ground'

Kendall'Sq.'/'MIT'

10%%<%Link%Loss%≤%90%% Link%Loss%>%90%%

Kendall'Sq.'/'MIT'

Charles/MGH'

Figure 1: Packet loss rates over a 3G cellular network are shown on the subway in Boston, MA. Temporary dead zones in tunnels cause connection timeouts that leave links under-utilized, even when the train comes above-ground.

Categories and Subject Descriptors
C.2.1 [Network Architecture and Design]: Wireless networking, 802.11 (Wi-Fi), cellular data networks, TCP, sparse coding, data-driven model learning

1. INTRODUCTION
Wireless networks struggle to cope with link ﬂuctuations in everyday scenarios. Any user who has accessed in-ﬂight
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proﬁt or commercial advantage and that copies bear this notice and the full citation on the ﬁrst page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior speciﬁc permission and/or a fee. Request permissions from permissions@acm.org. MobiHoc’15, June 22–25, 2015, Hangzhou, China. Copyright c 2015 ACM 978-1-4503-3489-1/15/06 ...$15.00. http://dx.doi.org/10.1145/2746285.2746318.

Wi-Fi, checked a smart phone on an elevator, or browsed the web on a train can attest to frustrating pockets of dismal throughput. Such disruptions result from the drastic changes in signal propagation that occur as users travel through complex environments with varying sources of multipath reﬂection, distances to base stations, line-of-sight occlusions, and interference. Generous provisioning of network infrastructure is the de-facto solution in practice, but more ﬂexible and scalable strategies are needed [1]. Addressing delivery failure at the link layer is attractive because it shields higher layers of the network stack from transient link variations without changing radio hardware [2, 3]. When packet delivery is stable, applications and protocols like TCP realize higher throughput with lower variation, simplifying design and improving user experience [4]. However, predicting wireless link state at the scale of individual packets is diﬃcult in real-world scenarios where steady-state assumptions like stable loss rates or location-dependence have limited eﬃcacy. For example, we see in indoor experiments that the relationship between physical location and packet loss can change without warning if a coworker politely holds open a metal door, extending a link’s range. These eﬀects are common, especially in environments where construction predates pervasive wireless communication. In response, we present State-Informed Link-Layer Queuing (SILQ), a system that uses data-driven learning to mitigate the destabilizing eﬀects of wireless link loss. SILQ predicts link state at the 100ms time-scale to anticipate tempo-

rary outages and buﬀer packets accordingly. Real-time predictions are made by actively probing links, matching the resulting measurement sequences to an overcomplete dictionary of prominent patterns learned oﬄine, and then classifying these sparse feature vectors to determine if they precede an outage. By preempting transmissions that are unlikely to succeed, SILQ reduces performance degradation caused when higher layers of the network stack react adversely to missing packets, for instance when TCP times out in a subway tunnel. Unlike prior cross-layer approaches to this problem, SILQ does not break connections in the middle of the network or modify transport protocols [4, 5, 6]. SILQ’s prediction model improves upon past data-driven link models by expressing state as a linear combination of a few canonical packet-loss patterns, called features. This sparse linear model cuts away noise and restores missing data to improve statistical accuracy in the face of real-world data variations [7]. More-stable feature vectors can then be classiﬁed with a linear Support Vector Machine (SVM) to identify outages [8]. This method produces stable predictions looking further ahead in time than standard techniques like regression [9]. We demonstrate that this noise-, variation-, and deletion-tolerant model is computationally simple enough for real-time operation on mobile devices, and also supports the design ﬂexibility to trade prediction accuracy for bandwidth and power improvements. We build our predictive link model by ﬁrst learning a dictionary of link-state features oﬄine. Training data consists of UDP packet receptions recorded in three scenarios: • An Unmanned Aerial Vehicle (UAV) ﬂying over rural farmland • Users walking throughout an active oﬃce building and riding an elevator • Users traveling through a city on the subway We show that features captured by sparse coding, a form of unsupervised clustering [10], reﬂect the eﬀects of line-ofsight occlusions, radio range limitations, and interference. Furthermore, we show that features are often universal across rural, urban, indoor, outdoor, high mobility, and low mobility links. In contrast, we ﬁnd that our supervised SVM outage-predictors are speciﬁc to an environment, though predictions remain accurate over days, weeks, and months. This paper presents a full implementation of SILQ on oﬀ-the-shelf Linux and Android devices, and validates it in the ﬁeld over 802.11b/g and 3G cellular networks. We compare our sparse-coding link model to traditional modeling approaches based on loss-rate and heuristics in terms of both statistical accuracy and network throughput under predictive queuing. Ultimately, we show that SILQ boosts throughput in an indoor oﬃce by 2x when used with Linux TCP, while achieving 4x lower variation. SILQ further improves throughput by 4x on the Boston subway where 3G coverage is spotty. Finally, we show that the power overhead of our approach can be reduced to only 4-5% above a baseline data connection by updating predictions less frequently when the link is in a steady state.

Airborne	  Scenario	  	  

Forest	  	   Nodes	  

Open-­‐Field	   Nodes	  

UAV	   	  Node	  

Ground-­‐	  	   Structure	   Nodes	  

Open-­‐Field	  Node	  	   Loca4on	  

Forest	  Node	  	   Loca4on	  

Ground-­‐Structure	  	   Node	  Loca4on	  

Figure 2: Our UAV’s ﬂight path over ﬁelds and farmland in upstate New York is shown. Ground receivers are placed in three locations to capture the link characteristics of an open ﬁeld, dense forest, and parking lot with surrounding ground structures.

probe packets every 1ms from a mobile node to a nearby base station that logs sequence numbers. We detect lost probes from non-sequential receptions and construct contiguous streams of probe deliveries, called “traces.” These high-granularity traces are used to build a link model in Section 4, though we introduce an adaptive probing mechanism in Section 5.1 to reduce SILQ’s runtime probing overheads. In our experiments, transmitters and receivers are all oﬀthe-shelf mobile devices. For 802.11 measurements in the UAV and indoor oﬃce scenarios, we use an array of laptops, Mobile Internet Devices (MIDs), and BeagleBone Blacks depending on weight, power, and portability restrictions. Each accesses the wireless medium using a T-Link TL-WN727 USB dongle, and we disable hardware retransmissions by sending packets in broadcast mode. For subway experiments, we access the 3G cellular network with an Android smartphone or by tethering nodes to an Apple iPhone 6. In this case, we cannot disable ARQ with cell towers, though the eﬀect is constant throughout data. Given the speed and range of our UAV links, we transmit at 1Mbps modulation for best symbol resilience. In order to compare results across environments, we rate-limit links in the oﬃce and subway to 1Mbps. In practice, SILQ has been scaled to 20Mbps and deployed on commercial 802.11, 3G, and LTE cellular networks.

2.1 UAV Air-to-Ground Communication
Our UAV scenario consists of a ﬁxed-wing SIG Rascal 110 ﬂying a dumbell shaped pattern over an airﬁeld in upstate New York on precipitation-free days, as shown in Figure 2. Autopilot maintains a consistent ﬂight path to within 15m across laps, while keeping altitude steady at 100m and velocity at 20m/s. The property spans forest and farmland, contains fewer than 50 ground structures or cars at any time, and is free of 802.11 interference. Data collection is conducted over 802.11b with a single broadcast transmitter attached face-down to the UAV’s wing.

2. TEST SCENARIOS
We begin by describing our UAV, indoor oﬃce, and urban subway scenarios. In each environment, we collect initial link measurements by transmitting 66-byte (66B) UDP

Indoor	  Oﬃce	  Scenario	  with	  Elevator	  
Start/Finish	  
Access	  Point	  	   Environment	  

also observe that signals are powerful enough to reach several ﬂoors to the basement, but that links can be blocked at ranges as short as 15m should ﬁre doors occlude transmission. For this scenario, our data includes cross-channel interference from the building’s 802.11 network.

Access	  Point	   Elevator	   Fire-­‐Proof	   Doors	   2nd	  Floor	   1st	  Floor	   Ground	  Floor	   Basement	  

2.3 Subway
Our subway scenario is shown in Figure 1. A user travels between Cambridge and Boston, MA in a train car that passes both below and above ground. Train speed varies depending on track conditions, and precise positioning is unknown within tunnels. The user transmits data from a mobile device to a server accessed via a ﬁrst-hop 3G cellular connection and subsequent hops over wired Internet. We observe that round trip delays average 150ms and ﬂuctuate within a manageable distribution. Queuing delays cause UDP probe packets to arrive in bursts of roughly consistent length and spacing. As illustrated in Figure 1, we observe many regions of poor reception, but also extended regions of excellent reception when 3G repeaters are nearby or the subway crosses an above-ground bridge. This scenario is completely uncontrolled and is used to validate SILQ’s methodology in an everyday situation of particular aggravation to the authors.

Fire-­‐Proof	  	   Doors	  

2nd	  Floor	  Elevator	  

Figure 3: Our walking tour throughout an active oﬃce is shown in blue. Links are attenuated by concrete and brick, and obstructed by ﬁre-proof doors and an elevator car. Ground receivers are placed in identical orientations in three groups throughout the property to capture diﬀerent signal-propagation characteristics: 1. Field nodes at the end of the runway have clean line-ofsight to the UAV during most of the ﬂight. For them, packet loss is driven primarily by radio range. 2. Ground structure nodes are placed in a parking lot partially surrounded by several 5m-tall metal buildings. Between 4 and 6 cars are parked within 10m of nodes, and vehicle positions change across ﬂights. These structures create line-of-sight occlusions and multipath reﬂections that aﬀect packet delivery depending on geometry and UAV position. 3. Forest nodes are placed in a grove of ﬁr trees at the end of the runway. They capture similar link conditions to ﬁeld nodes, though uneven ground, trees, branches, and leaves aﬀect line-of-sight transmission. This UAV scenario is of both experimental and operational interest. First, the simple interference-free environment is ideal for conducting controlled wireless experiments to serve as building blocks for more complicated scenarios. Second, UAV wireless communication has commercial applications in bridge and crop inspection, rural Internet relay, package delivery, and a myriad of military scenarios.

3. PREDICTIVE LINK-STATE MODELING
In order to implement predictive queueing, we ﬁrst build a model of wireless packet delivery for our three scenarios. In the literature, link models typically reﬂect two approaches: physical models that use signal propagation to explain packet loss, and data-driven models that mathematically capture those eﬀects with statistics like correlation between packet receptions. The former group includes distance-based attenuation, two-ray interference, and geometric occlusion models [11, 12, 13], while the latter includes loss-rate and Markov-chain models that reproduce distributions of consecutive receptions [14, 15, 16]. Though physics-driven methods are intuitive, they require detailed environment-speciﬁc information as inputs. On the other hand, generic data-driven models can require training datasets that grow exponentially with temporal scale. These two extremes are sometimes blended by locationbased statistical models [17, 18]. However, we ﬁnd that location information is generally a poor predictor of individual packet losses. For example, we observe in our UAV experiments that slight changes to antenna orientation drive major swings in link loss. We also see that the location of a single car close to ground-structure nodes causes drastic changes in packet loss behavior due to multipath reﬂection. Indoors, we see that loss characteristics change when doors are closed or metal objects are moved within oﬃces. And in the subway, location information is not even available once the train descends underground. For these reasons, we purposefully eschew location information in pursuit of a more accurate, general packet loss model. Our data-driven link model combines unsupervised sparse coding, a form of clustering, with supervised Support Vector Machine (SVM) classiﬁcation. This approach is closely related to state-of-the-art learning systems for vision and speech recognition that capture complicated features while tolerating non-Gaussian data variations [19]. Though ours is the ﬁrst sparse-feature model for wireless packet transmission to our knowledge, other learning techniques are gaining

2.2

Indoor Ofﬁce

Our indoor oﬃce scenario is shown in Figure 3. A user carries a laptop throughout an oﬃce building while transmitting data to an 802.11 access point in a second-ﬂoor oﬃce. For each experiment, the user repeats a walking tour that crosses a footbridge to a nearby building and then returns for an elevator ride down three ﬂoors to the basement. All experiments are conducted during high-traﬃc daytime hours when the building is used by several hundred students and faculty. We observe strong ﬂuctuations in link quality due to signal attenuation through brick and concrete, as well as blockages from ﬁre-proof doors and the elevator car. We

traction in networking scenarios. At the transport layer, game-theoretic simulations can discover TCP protocols tailored to network conditions [20]. At the physical layer, learning has been used to adapt forward-error-correction (FEC) to congestion in simulation [21]. Our approach stands out both by our use of sparse feature representation to cope with real-world measurement artifacts, as well our emphasis on live, online operation aboard mobile devices.

3.1

A Sparse Coding Link Model

Sparse coding is an unsupervised clustering method that solves for an overcomplete matrix of recurring patterns in training data. We call that matrix a dictionary and its column vectors features [10, 22]. Under this framework, a data vector can be approximated as a linear combination of features. By constraining the number of features used, approximations are represented by sparse coeﬃcient vectors with only a few non-zero entries. For data with noise or non-Gaussian variations like deletions, sparse approximations latch measurements to exemplar patterns, truncating weakly expressed information and restoring missing values. This process improves variation-tolerance in subsequent statistical analysis.

of as a data-driven method for reducing the space of possible states in the model. For example, if we train a dictionary with many features and a small penalty parameter (e.g. m ≈ 200 and λ ≈ 0.1), then D captures templates of frequently occurring link states. New link measurements are then matched to the closest template by setting k = 1. However, if the number of dictionary features is chosen to be small and lambda is chosen to be large (e.g. m ≈ 20 and λ ≈ 4), then training will extract a small number of the most prominent patterns in each training example. This produces a set of more-general features that can be combined to enumerate observed link states, for example by encoding with k = 8. We call these features primitives. In Section 4, we will show that these latter linear-combinational models generalize well across networks and environments, addressing a key limitation of traditional wireless link models.

3.2 Outage Prediction With SVMs
Using learned link primitives, we can build a predictive statistical model for all sorts of link-layer events that affect network performance. In this work, we use bulk data transport via TCP as an application vehicle to demonstrate SILQ’s gains and deﬁne the prediction target to be any upcoming sequence of 192 probe measurements containing a single delivery failure. This represents back-to-back transmission failures of 1500B MTU data packets sent at 1Mbps with the Linux default 7 ARQ retries, assuming a roundtrip time less than 100ms. Since most TCP implementations can recover quickly from a single packet drop using fast-retransmit, we instead try to detect these two-packet outages. We note that video streaming and transaction processing are other candidate applications for SILQ with potentially diﬀerent prediction targets. To implement binary event prediction, we use a linear SVM classiﬁer, which ﬁnds a separating plane between training data belonging to two classes, designated by class labels; later, new data vectors are compared to the separating plane using an inner-product and threshold test to predict their class membership. Since SVM training data in our application requires both sparse feature vectors and corresponding labels, it is considered a supervised technique. To derive labels yt for our data, we annotate a subset of traces from each environment, setting yt = 1 when an outage followed the measurement vector in the trace and yt = 0 otherwise. In Section 4.2, we show how to optimize over SVM conﬁguration parameters.

3.1.1

Ofﬂine Dictionary Training

We learn dictionaries oﬄine from a large set of link traces, eﬀectively exploiting past data to improve statistical accuracy for future data. Several formulations for dictionary training exist [23]; we use an optimization method based on the Least Absolute Shrinkage and Selection Operator (LASSO) that relaxes sparse coding’s constraint on the number of non-zero feature coeﬃcients by instead penalizing the sum of coeﬃcient magnitudes [24]: min
α,D

1 ||x − Dα||2 + λ||α||1 2 2

(1)

where D is a w × m dictionary containing m features of length w, x a normalized binary measurement vector representing the outcomes of w back-to-back probe transmissions, α its sparse feature representation, and λ a tunable positive value. To implement dictionary training, we segment ﬁeld traces into T windows of w measurements xt for t = 1...T . This objective produces qualitatively similar results to sparse coding but is convex, yielding stable, accurate solutions for D.

3.1.2

Online Measurement Encoding

While LASSO is eﬀective for oﬄine dictionary learning, it is too computationally intensive to compute sparse codes in real-time on mobile devices. Therefore, we use a D found by LASSO and solve for α online according to: min ||x − Dα||2 s.t. ||α||0 ≤ k 2
α,D

4. TRAINING MODELS 4.1 Learned Link Primitives
We ﬁrst train dictionaries on ﬁeld-collected traces using LASSO and examine the link primitives discovered in each of our scenarios. We learn separate dictionaries of size m = 20 with λ = 4 from each type of receiver: UAV ﬁeld nodes, UAV forest nodes, UAV ground-structure nodes, indoor ofﬁce nodes, and subway nodes. Figure 4 shows several features found often in each trace, with dips reﬂecting patterns of delivery failure and peaks capturing delivery successes. We see that primitives learned across UAV ground receiver types are similar. Taken together, they capture examples of abrupt changes to the link, gradual transitions, intermittent packet drops, and temporary outages lasting 10’s of milliseconds. When we examine the distributions of

(2)

with k a hard sparsity threshold. This alternative formulation can be greedily solved by Matching Pursuit (MP) [25] using a series of inner-product tests that iteratively select k features to reduce a residual. MP has O(mw) complexity and we show in Section 6.4 that feature encoding is a negligible part of SILQ’s running time. Our sparse-feature-based link model is related to state models like Markov chains in the literature [14, 16]. The process of learning a dictionary by clustering can be thought

nonzero coeﬃcients in each trace’s sparse representations, we ﬁnd that ﬁeld and forest nodes more often map to long runs of delivery successes or smooth gradual changes in link quality. This aligns with our experiment conﬁguration, in which these nodes have the best connectivity, are far from any structures that can suddenly occlude line-of-sight transmission, and are aﬀected mostly by radio range limitations. In contrast, the coeﬃcients computed for ground-structure nodes more-often map to abrupt transitions as buildings block line-of-sight until the UAV is immediately overhead. Primitives learned in the oﬃce environment bear a striking resemblance to those learned from UAV receivers. We see a mixture of temporary outages aﬀecting strong links, as well as abrupt on-to-oﬀ and oﬀ-to-on transitions. These links exhibit a greater degree of noisy variation than UAV nodes, possibly due to 802.11 cross-channel interference from the building’s wireless network. Subway atoms reﬂect the same general shapes, but prominently exhibit regular 10ms-long oscillations in delivery. According to log ﬁles, average probe inter-arrival times over the 3G network display 10ms bursts of back-to-back deliveries, likely due to queueing delays. This means that unsupervised learning captures primitives reﬂecting both wireless-link and network eﬀects.
UAV (car) UAV (field) Subway Link-­‐State	  Primi>ves	  BElevator y	  Environment	  

Link-­‐State	  Template	  from	  Subway	  Environment	  

Figure 5: A link-state template learned from subway data is shown. The eﬀects of link loss and bursts due to network delays are captured together, contrasting with Figure 4 where they are separated into diﬀerent primitives.
TCP	  with	  Outage	  Predic,on	  	   By	  Detector’s	  Receiver	  Opera/ng	  Characteris/c	   	   Op#mal	   Conserva#ve	  
•  Low	  Risk	  of	  Timeout	   •  High	  Sending	  Rate	   	  
Aggressive	  

1	   True	  	   Posi,ve	   Rate	   	   (Successful	  	   Outage	  	   Detec/ons)	  

•  Low	  Risk	  of	  Timeout	   •  Low	  Sending	  Rate	   	  
Poor	  

•  High	  Risk	  of	  Timeout	   •  High	  Sending	  Rate	   	  
0	  

•  High	  Risk	  of	  Timeout	   •  Low	  Sending	  Rate	   	  
1	  

0	   False	  Posi,ve	  Rate	   (Unnecessary	  Queuing)	  

UAV	  Ground-­‐Structure	  

UAV	  Field	  

Indoor	  Oﬃce	  

Subway	  

Figure 4: Common link primitives learned from different environments using LASSO with m = 20 and λ = 4 are shown. They capture fast and slow link transitions as well as bursts of delivery successes.

Figure 6: The eﬀect of outage-detection sensitivity on TCP is illustrated. Detectors that forward data aggressively run a high risk of timeout. Those that forward conservatively rarely time out, but often under-utilize the link. beyond just their training data, reducing the need for extensive data collection in new scenarios.

4.1.1

Separating Network Effects

As discussed in Section 3.1, sparse feature models can be conﬁgured either to match link states to a single template, or to express them as linear combinations of primitives. The practical result of this choice can be seen by comparing the primitives in Figure 4 to the template in Figure 5 learned from subway data using m = 200 and λ = 0.1. We see that templates simultaneously represent the concurrent eﬀects of wireless link loss and network queuing delays. This contrasts with models based on linear combinations of primitives that automatically separate these eﬀects. This observation highlights the power of applying sparse coding to link modeling: our approach naturally expresses complicated network measurements in a simple, well-separated vector of high-level information – statistics can then be computed relative to this meta-information. In addition to supporting stable, intuitive statistical models, our method uses data-driven learning to identify link states that generalize

4.2 Tuning an SVM Predictor for Temporary Link Outages
We next show how to tune a linear SVM to predict outages from sparse-coded link measurements. Fundamentally, SVMs separate classes of data by solving for a max-margin separator between training examples of each class. When used for detection, this allows us to obtain a tradeoﬀ between true positive and false positive rates by shifting the SVM separator toward one class or another. Within SILQ, a predictor’s sensitivity ultimately aﬀects realized network performance. Figure 6 illustrates this relationship in the context of an outage predictor and TCP throughput. Predictors tuned to forward packets aggressively raise few alarms, achieving low false positives but also low true positives. The result is a fast sending rate but a high risk of TCP timeout when outages are not well-predicted. At the other end of the spectrum are predictors tuned to forward data conservatively by raising many alarms, leading to

	  With	  Throughput	  for	  the	  Indoor	  Oﬃce	  Scenario	  
Most	  Conserva+ve	  

ROC	  as	  a	  Func@on	  of	  Sparsity	  

1 0.9
False Positives True	  Posi@ve	  Rate	  
More	  Conserva+ve	   Avg:	  544.5	  kbps	   Std:	  	  	  43.4	  kbps	   Op+mal	   Avg:	  709.0	  kbps	   Std:	  	  	  35.9	  kbps	  

Avg:	  174.8	  kbps	   Std:	  	  	  45.2	  kbps	  

0.8 0.7 0.6 0.5 0

Aggressive	   Avg:	  410.0	  kbps	   Std:	  213.8	  kbps	  

k	  =	  4	   k	  =	  2	   k	   	   =	  1	   	  
Raw	  TCP	  Baseline	   Avg:	  322.6	  kbps	   Std:	  	  	  83.0	  kbps	  

0.1

0.2 0.3 0.4 True Positives False	  Posi@ve	  Rate	  

0.5

Figure 7: Throughput is shown at various points on our outage detector’s ROC proﬁle, empirically demonstrating the relationship described in Figure 6 in our indoor oﬃce scenario. We also see that training parameters such as encoding sparsity k affect the ROC proﬁle of our outage detector.

link characteristics that are common across diverse environments. For Figure 8, we learn primitives from a single scenario and use them to compute sparse feature representations and train outage predictors in a diﬀerent scenario. We not only see that primitives generalize well across environments, but that some amount of data diversity actually improves outage recall, possibly serving as a defense against overﬁtting. We note that this trend does not hold when we use a template model – for example, we ﬁnd that templates from the oﬃce scenario do not accurately characterize the queuing eﬀects of the subway, hurting recall. We also found that SVM separators do not port as well as dictionaries across scenarios, since the precise relationships between link states over time are environment-dependent. For example, a fast subway train may produce shorter temporal correlation between a degrading link and an upcoming outage than in the oﬃce scenario. However, we saw that SVMs trained on months-old data still provided good performance in the ﬁeld, suggesting that such relationships are fundamentally stable. Since SVM training is less computationally intensive and requires less data than dictionary training, SILQ implements this function on-device to allow fast adaptation to new environments.

a high true positive rate but also many false positives. These detectors rarely let timeouts happen, but severely underutilize links in good conditions. In Figure 7, we show the empirical eﬀect that prediction sensitivity has on throughput, measured in our indoor oﬃce scenario with the methodology of Section 6.2. We see that aggressive models achieve poor average throughput with a high standard deviation – while they opportunistically grab all available sending opportunities, they also often time out and cause throughput to drop to zero for a long period of time. However, overly conservative models end up realizing lower throughput since they fail to send often enough when the link is usable. We also see in Figure 7 that optimizing over sparse coding parameters like m, λ, and k changes the resulting SVM predictor’s ROC proﬁle. For example, if we set m = 20, but k = 2, then the detector is strictly worse than setting k = 4. Guided by these results, we reason that the best link model in terms of TCP throughput should maximize correct outage predictions while ensuring plenty of sending opportunities. We implement this by choosing the model that maximizes outage recall, provided non-outage (i.e. sending opportunity) recall is above a threshold of 0.75. Varying m, λ, k, and the SVM’s test threshold, we train dictionaries and SVMs while holding out a subset of traces for evaluation to obtain m = 20, λ = 4, and k = 4 as our best conﬁguration. Examining our optimized link model, we notice that the SVM heavily weights three particular dictionary primitives, suggesting that TCP throughput is highly sensitive to a small number of link states.

Outage-­‐Predic-on	  Recall	  when	  Transferring	  Primi-ves	  	  
SVM)Training)&)Test Environments
UAV Indoor)Office Urban)Subway UAV Indoor)Office Urban)Subway

Dict.)Training) Environment

0.81 0.83 0.81

0.82 0.79 0.81

0.74 0.79 0.73

Figure 8: Outage-prediction recall is reported when dictionary primitives are learned in one scenario and applied in another. This establishes that learned link primitives are often universal across environments and networks.

5. SILQ ARCHITECTURE
To demonstrate throughput gains from the predictions of our sparse-coding link model, we implement SILQ for Linux and Android devices. At runtime, SILQ nodes each measure their outgoing links by transmitting UDP probes at a regular interval. Reception reports are then bounced back to senders, piggybacked on probes traveling in the opposite direction. After a regular measurement interval (100ms for all results in this paper), each SILQ node computes a link prediction using the prior w measurements. If an outage is predicted, SILQ holds all data packets at the link layer until a new prediction indicates otherwise.

5.1 Measurement Protocol
SILQ’s measurement protocol is shown in Figure 9. Nodes maintain a bitmap Linkin marking the last w incoming probes relative to the most recent, denoted Seqin . An arriving probe’s sequence number is compared against Seqin to detect delivery failures before updating Linkin and Seqin . Our UDP probes use 69B to carry a reception report consisting of Linkin as a 3B bitmap and Seqin as a 2B unsigned short. Nodes also maintain a round-trip-time estimate dRTT to detect lost reception reports. For example, if Seqout = 20

4.3

Transferring Features

While some link-state primitives may be unique to a particular environment, Section 4.1 showed that similar features are found among scenarios. Intuitively, these should represent common drivers of link loss like range eﬀects in large open environments or abrupt transitions due to occlusions. We next empirically verify that learning captures canonical

Link	  Measurement	  Protocol	  
SeqOut 0! SeqOut 1! SeqOut 2! SeqOut 3!

Seq 0 ! Seq 1 !
nk In 0, Li Seq In=

x	  
= [ 1]!

SeqIn= 0   LinkIn= [

1]!

report to bounce back. For example, TCP uses a tripleduplicate-ACK to trigger fast recovery, so we set PMin = 3. With dRTT = 110ms, dTX = 12ms, and a sending queue of QSILQ = 50 packets, we can set tsleep = 454ms. Case 2: Bad-to-Good Transition When the link is unavailable, we can save power by turning the wireless radio oﬀ and waking periodically to check link status. In this case, a large tsleep risks underutilizing the link by slowing SILQ’s reaction time. We tune this quantity empirically – for example we see an average 3s delay between restored connectivity and SILQ’s ﬁrst predicted sending opportunity in our oﬃce scenario and conservatively set tSleep = 900ms to ensure that the link is checked every 1s.

Lost	  Report	   Detec2on	  

Recv:	  	  SeqIn= 0

State	  Update	  
LinkIn= [ 1]!

SeqIn= 2   LinkIn= [101]! SeqIn= 3   LinkIn= [011]!

E[SeqIn]= SeqOut– dRTT*1! ! ! = 3 – 2!= 1	   LinkOut = [ ! ! 10]
!

Seq 4 ! Seq 5 !

x	  

SeqOut 6! SeqOut 7!

Lost	  Report	   Detec2on	  

Recv:	  	  SeqIn= 0 !

State	  Update	  

[101] nk In = 5, Li Seq In=

SeqIn= 5   !LinkIn= [101]!

x	  

LinkIn= [ 1]!

E[SeqIn]= SeqOut– dRTT*1! = 7 – ! != 5	   2
! LinkOut = [000]! !

SILQ	  So=ware	  Architecture	  
SILQ	   Network	   Applica/on	  
Data	  Channel	  
SILQ::dataChan	  
Link	  State	   Updates	  

Figure 9: SILQ’s measurement protocol piggybacks reception reports containing a bitmap of incoming probe receptions on outgoing probes. Senders use estimated round-trip-time dRT T to detect lost reception reports. probes are sent and only the ﬁrst report successfully bounces back, the sender must compute how many reports should w have arrived: Seqout − dRTT 100ms = 19. This means that bidirectional probing captures asymmetric link eﬀects, except when many reception reports are lost at the end of an interval. In this case, we make the conservative assumption that the link is oﬀ in both directions. In practice, we ﬁnd that disruptions due to dRTT ﬂuctuation resolve after one or two missed 100ms windows, even on a crowded carrier network.

Control	  Channel	  
SILQ::ctrlChan	  

Predic7ons	  

FSM	  
Link	  State	  Bitmap	  

‘Forward’	  	  

[	  Outgoing	  Packet	  Queue	  ]	  

ne?ilter_queue	  

User	  space	   Kernel	  space	  

Incoming	   Data	  Packets	  

Outgoing	   Data	  Packets	  

Outgoing	  	   Incoming	   Probes	   Probes	  

Wireless	  Medium	  

Figure 10: The SILQ architecture consists of logically separated data and control channels that isolate link measurement protocols from data forwarding protocols. In our testbed, both access the same wireless medium using the same interface.

5.1.1

Managing Probing Overheads

Probing consumes both power and network bandwidth, resources that must be balanced against measurement ﬁdelity. For example, in Section 6, Figures 12 and 14 show us that sending 20 probes per 100ms consumes 22% of bandwidth for a 1Mbps link and 9% additional battery for a laptop with a power-hungry USB radio. Beyond simply reducing probe rate, we introduce a strategy for managing overhead by updating predictions lessfrequently when the link is in a stable steady-state. This steady-state sleep mode detects sustained strong-connectivity or sustained outage by 3 consecutive all-1 or all-0/missing link reports, respectively. In this mode, SILQ updates predictions every tSleep ms using a single round of probing. We set tSleep so that regular prediction will resume in two cases: Case 1: Good-to-Bad Transition When a good link degrades, outgoing data will be vulnerable to loss due to a stale state prediction. We therefore compute the portion of SILQ’s sending queue at risk and set tSleep to ensure tolerable loss: tSleep = dTX ∗ (QSILQ − PMin ) − dRTT (3)

5.2 Software Architecture & Implementation
SILQ’s software architecture is shown in Figure 10. It is implemented in user space for Linux and Android devices using netfilter_queue to administer packets within the kernel. Each node runs both data channel and control channel controllers – the data channel controller holds or forwards packets based on the current predicted link state, while the control channel controller is responsible for probing and providing measurements to a ﬁnite-state machine (FSM) linkmodel module. The logical separation between channels simpliﬁes probing and data-delivery protocols, however both access the same wireless medium using the same wireless interface for all results in this paper. SILQ’s FSMs are modular, allowing us to compare sparse coding to traditional modeling approaches in the ﬁeld. We implement NO_OP (i.e. always-forward), Loss Rate Threshold, Heuristic Hold, and Sparse Coding FSMs. For all but the NO_OP FSM, trivial all-0 measurement vectors always predict outage, and trivial all-1 measurement vectors predict no-outage. For Loss Rate Threshold and Heuristic Hold, we use a “conservative” parameter to require back-to-back nooutage predictions before turning a bad link back on. This stabilization procedure is needed for loss-rate and heuristic predictors to achieve any throughput gains in the ﬁeld. In contrast, we optimize the sparse coding FSM for stable prediction using the statistical methods of Section 4.2.

where PMin is the minimum number of packets that must be delivered to maintain the connection, QSILQ is the current send-queue size, dTX is the transmission time of an MTU data packet, and dRTT is the time needed for a full link

6.

SYSTEM EVALUATION

We evaluate SILQ in the UAV, oﬃce, and subway scenarios by measuring TCP throughput, power consumption, and CPU utilization. For these experiments, we train dictionaries and SVMs as in Section 4 and then transfer a large ﬁle from a mobile device to a remote server via TCP. We compare raw TCP to SILQ using predictions from sparse coding, loss-rate threshold, and heuristic models. Our heuristic holds data packets if all probes in a previous window are lost, representing the simplest method of coping with the long out-of-range regions in our UAV and subway tests. We empirically tune a loss-rate threshold and “conservative” parameters for comparison methods to maximize throughput. Our baseline TCP protocol is the Linux default TCP Cubic. We enable SACK-enhanced F-RTO for resilience over wireless links, use 9 keep alive probes, and allow 15 back-toback timeouts before terminating a connection. Our wireless adapters use 7 ARQ retries for data packets, while we disable ARQ for probes over 802.11 links. To control for diﬀerences in link quality, we compare overall throughput for diﬀerent experiments using total data transferred divided by the total number of transmission opportunities for data packets. To illustrate the eﬀect of small changes in experiment parameters like walking speed, Figures 11(a) and 11(b) plot loss rate for two runs in our oﬃce scenario using faint green, yellow, and red bars. We see that the durations of both outages and regions with strong connectivity diﬀer, biasing total throughput by aﬀording more transmission opportunities during the run shown in Figure 11(a) independent of network protocol. Our comparison metric corrects for this eﬀect in post-processing by counting only stable regions with back-to-back probe deliveries in our throughput denominator.

Figure 11: TCP throughput is plotted against the left axis in our indoor oﬃce scenario. For reference, link loss is plotted against the right axis using faint green, yellow, and red bars. Link-state predictions are represented by pink and white markers at the top of each plot. SILQ’s optimal, aggressive, and conservative forwarders behave as anticipated, and TCP connections wake up quickly with SILQ.

6.1

UAV Throughput in Emulation

enough to ﬁx isolated losses. In all cases, we found that lossrate and heuristic predictors underperform sparse coding – our heuristic only captures coarse outages lasting hundreds of milliseconds, while loss-rate predictions map both outages and intermittent losses to a single metric.

Due to logistical barriers related to personnel, budget, and weather, we validate throughput in the UAV scenario using a lightweight link emulator. During emulated ﬂights, SILQ runs on our UAV payload and ground nodes, while probe and data packets are sent over wired interfaces to a central controller that replays ﬁeld traces. The emulation controller forwards packets to their destination only if a packet reception was recorded in the ﬁeld trace. We emulate a 1Mbps link and compare 1500B MTU data packets against 12 backto-back 1ms probes to reﬂect a 12ms transmission time. An error in any probe causes the data packet to be dropped. The controller simulates both ARQ and CSMA. Emulated traces are held out of the training datasets used for dictionary and SVM training. We ﬁnd that SILQ boosts throughput by 22% over a raw TCP connection for links to ground-structure nodes. This improvement from 529 kbps to 644kbps is attributed to the high-quality predictions of our sparse coding link model, as heuristic and loss-rate predictions hurt throughput by 5% and 10%, respectively. For ﬁeld and forest nodes, SILQ produces little beneﬁt to average throughput, but yields an extremely low throughput deviation of 19kbps across ﬂights. Examining UAV links by type, we ﬁnd that SILQ’s sparsecoding predictions help most when link conditions vary often. Our learning approach successfully captures measurement sequences with strong predictive power caused by the complex geometry of nearby buildings and cars. In contrast, when loss is intermittent at radio range extremes, ARQ is

6.2 Indoor Elevator/Ofﬁce Environment
Next, we deploy SILQ in our indoor oﬃce environment. For these experiments, wireless links are stable when users are close to an access point, but exhibit both abrupt outages due to ﬁre doors and gradual degradation caused by an elevator car. Averaging ﬁve runs, SILQ improves throughput by 2x over a raw TCP connection, while reducing throughput variation by 4x. Predictions from our sparse coding model produce 15% higher throughput than the heuristic and 50% higher throughput than loss-rate predictions. We found it diﬃcult to tune either comparison prediction-method to perform well over the entire walking tour – unlike sparse-coding, neither could cope with both abrupt link transitions and gradual link degradation using a single conﬁguration. Figure 12 shows that sparse-coding predictions are also more resilient to lower probe rates than heuristic or loss-rate models. In Section 6.4, we show that this allows us to better match SILQ’s power consumption to available resources. Figure 11 shows a closer look at SILQ’s performance over example runs in the indoor oﬃce. For reference, we plot link loss against the right axis using faint green and red bars. When appropriate, SILQ’s link state predictions are shown at the top of a plot, with a pink marker indicating that the link is predicted to be on, and whitespace showing a predicted outage. TCP throughput is plotted against the left axis using black and blue shaded curves.

Eﬀect	  of	  Increasing	  SILQ	  Probe	  Interval	  	  on	   TCP	  Throughput	  
1000 Throughput (kbps) 800 600 400 200 0 4 Max.	  Possible	  Data	  Rate	  	   (A#er	  Probe	  Overhead)	   6 8 10 12 14 Probe Interval 992	  kbps	   16 18 20 Sparse	  Coding	   Heuris0c	  	   Loss	  Rate	  Threshold	  

SILQ	  Power	  ConsumpIon	  in	  WaJs	  
Lenovo	  Thinkpad	  X200	  Laptop	   with	  USB	  Wi-­‐Fi	  
Device	   Baseline	  
	  2.54	   8.59	  

HTC	  One	  (M8)	  Smartphone	   with	  built-­‐in	  Wi-­‐Fi	  
Device	   Baseline	   SILQ	  Fixed-­‐Rate	   Probing	   SILQ	  Steady-­‐	   State	  Sleep	  

SILQ	  Fixed-­‐Rate	   Probing	  

SILQ	  Steady-­‐	   State	  Sleep	  
	  0.49	  

	   	  0.67	  	  1.03	  	  1.19	   	  2.0	   	  0.37	  

12W	   8W	   4W	  
0.56	   0.04	   	  0.21	   0.22	   2.76	   0.15	  	  0.16	   0.17	  

779	  kbps	   845	  kbps	  

995	  kbps	  

5	  

Probes/100ms	  

7	   14	   20	  

5	  

Probes/100ms	  

7	   14	   20	  

Figure 12: Average throughput in our indoor oﬃce scenario is shown as we reduce probe rate. Predictions from a sparse coding model are more resilient to low probe rates than heuristic or loss rate models. In Figure 11(a), we see that TCP times out as soon as link conditions degrade and does not wake up until several minutes later. This causes the mobile device to miss sending opportunities during nearly 40% of the tour. In contrast, Figure 11(b) shows SILQ running with our best-tuned sparsecoding model, which predicts impending outages well and causes TCP to wake up rapidly. We also show SILQ’s performance when our SVM outage predictor is tuned according to the True Positive/False Positive tradeoﬀs discussed in Section 4. We see how the more aggressive forwarder in Figure 11(c) sends often, but times out due to its optimism about link state. In contrast, the conservative forwarder in Figure 11(d) misses large portions of sending opportunities in an eﬀort to avoid timeouts.

Figure 14: In our oﬃce environment, our steadystate sleep mode reduces power consumption to 4% and 5% above an active data connection, depending on device. This method forgoes predictions when the link is in a steady-state. a remote server. On the phone, we have a user request popular web pages at a rate of roughly one site per 4 seconds. For both devices, we measure CPU utilization and battery consumption by averaging over 30 seconds. We observe that SILQ uses only 2% CPU on the Intel chip, and 4% on the smartphone. Less than 1% of SILQ’s runtime is spent computing predictions, with most time spent servicing packets. Using a dictionary of 30 atoms for w = 20 probes per interval, SILQ’s memory footprint is 972KB. Figure 14 shows SILQ’s power consumption for each device. We see that lower probe rates reduce SILQ’s power overhead signiﬁcantly – for example, by 14% on the laptop and 6% on the phone when we drop from 20 to 14 probes per 100ms. We also report savings from our steady-state sleep mode in the indoor oﬃce environment. Since our devices prevent us from easily powering radios on and oﬀ without re-associating with access points, this savings is calculated in post-processing from SILQ’s logs, based on recorded steady-state regions. When we compare SILQ’s battery consumption overhead using the steady-state sleep mode to the power consumed by an active data connection, we see that our method only costs an additional 4% and 5% per device, utltimately for a 2-4x boost in throughput.

6.3

Real-World Subway Environment

Our subway scenario validates SILQ “in the wild”. This scenario exhibits a great deal of uncontrolled complexity – mobile nodes experience fast changes in velocity, sudden occlusions from tunnels, spotty connectivity to 3G repeaters, and competing wireless traﬃc. This scenario serves to demonstrate the strength of SILQ’s end-to-end design since no code alterations or network conﬁguration changes were needed for the system to function. Figure 13 compares raw TCP to SILQ with a sparse coding predictor over subway rides between the Harvard Square and Charles-MGH stops. Note that Figure 13(a) represents an inbound trip from Harvard to Charles-MGH, and Figure 13(b) represents an outbound trip between the two stops in the opposite order. While a raw TCP connection times out for minutes at a time, SILQ utilizes nearly every available sending window and improves throughput by 4x over a six minute train ride.

7. CONCLUSION
This paper presented SILQ, a link layer queuing system that uses machine learning to predict link outages so that packet loss can be avoided. Using sparse coding, we demonstrated that a link model based on combinations of simple primitives makes better use of training data and leads to higher quality predictions than models requiring steadystate assumptions about link quality. SILQ can be easily deployed on Linux and Android devices, and has been demonstrated to achieve up to 4x throughput gains in scenarios of practical interest.

6.4

Proﬁling CPU & Battery Overheads

Finally, we proﬁle SILQ’s performance on two of our experimental devices: a Lenovo X200 laptop running a 1.86GHz Intel Core 2 Duo processor with a T-Link TL-WN722N USB wireless radio, and an HTC One (M8) smartphone running a 2.36GHz Qualcomm Snapdragon 801 ARM processor with onboard Wi-Fi. We run SILQ on each device with only baseline background-process activity. Throughout, screens are kept at their default brightness. On the laptop, we create network data traﬃc by transferring a large ﬁle via TCP to

Acknowledgements
This research was supported in part by gifts from the Intel Corporation. Stephen Tarsa is supported by a fellowship from the Siebel Foundation.

Figure 13: We compare a raw TCP connection to TCP atop SILQ using predictions from a sparse-coding link model during inbound and outbound subway rides. Over a 3G cellular network, raw TCP times out quickly and misses many sending opportunities. With SILQ, timeout is avoided and throughput increases by 4x.

8.

REFERENCES

[1] J. Andrews, H. Claussen, M. Dohler, et al., “Femtocells: Past, Present, and Future,” 2012 IEEE Selected Areas in Communications. [2] H. Balakrishnan, V. N. Padmanabhan, S. Seshan, and R. Katz, “A Comparison of Mechanisms for Improving TCP Performance over Wireless Links,” 1997 IEEE/ACM Trans. on Networking. [3] C. Parsa and J. Garcia-Luna-Aceves, “TULIP: A Link-Level Protocol for Improving TCP over Wireless Links,” in 1999 IEEE Wireless Comms. and Networking Conf. [4] H. Balakrishnan, S. Seshan, and R. Katz, “Improving Reliable Transport and Handoﬀ Performance in Cellular Wireless Networks,” 1995 Wireless Networks. [5] A. Bakre and B. Badrinath, “I-TCP: Indirect TCP for Mobile Hosts,” in 1995 IEEE Distr. Computing Sys. [6] T. Goﬀ, J. Moronski, D. Phatak, et al., “Freeze-TCP: A True End-to-End TCP Enhancement Mechanism for Mobile Environments,” in 2000 IEEE INFOCOM. [7] S. Tarsa and H.-T. Kung, “Hierarchical Sparse Coding for Wireless Link Prediction in an Airborne Scenario,” in 2013 IEEE MILCOM. [8] C. Burges, “A Tutorial on Support Vector Machines for Pattern Recognition,” 1998 Springer Data Mining and Knowledge Discovery. [9] S. Tarsa, A. Kumar, and H.-T. Kung, “Workload Prediction for Adaptive Power Scaling Using Deep Learning,” in 2014 IEEE Int. Conf. on IC Design & Tech. [10] B. Olshausen, D. Field, et al., “Sparse Coding With An Overcomplete Basis Set: A Strategy Employed By VI?,” 1997 Vision Research, Pergamon Press. [11] R. Valenzuela, “A Ray Tracing Approach to Predicting Indoor Wireless Transmission,” in 2003 IEEE Vehicular Technology Conference. [12] J. Andersen et al., “Propagation Measurements and Models for Wireless Communications Channels,” IEEE Comms. Magazine, 1995.

[13] M. Ylianttila, J. M¨kel¨, and K. Pahlavan, “Analysis a a of Handoﬀ in a Location-Aware Vertical Multi-Access Network,” 2005 Computer Networks. [14] E. Gilbert et al., “Capacity of a Burst-Noise Channel,” 1960 Bell Syst. Tech. J. [15] K. Farkas, T. Hossmann, F. Legendre, B. Plattner, and S. Das, “Link Quality Prediction in Mesh Networks,” 2008 Computer Communications. [16] A. Konrad, B. Zhao, A. Joseph, and R. Ludwig, “A Markov-Based Channel Model Algorithm for Wireless Networks,” 2003 Wireless Networks. [17] H.-T. Kung, C.-K. Lin, T.-H. Lin, S. Tarsa, and D. Vlah, “A Location-Dependent Runs-and-Gaps Model for Predicting TCP Performance over a UAV Wireless Channel,” in 2010 IEEE MILCOM. [18] B. Ferris, D. Fox, and N. Lawrence, “WiFi-SLAM Using Gaussian Process Latent Variable Models.,” in 2007 IEEE IJCAI. [19] T.-H. Lin and H.-T. Kung, “Stable and Eﬃcient Representation Learning with Non-Negativity Constraints,” in 2014 Intl. Conf. on Machine Learning. [20] K. Winstein and H. Balakrishnan, “TCP Ex-Machina: Computer-Generated Congestion Control,” in 2013 ACM SIGCOMM Computer Communication Review. [21] H. Seferoglu, A. Markopoulou, U. Kozat, M. Civanlar, and J. Kempf, “Dynamic FEC algorithms for TFRC ﬂows,” 2010 IEEE Trans on Multimedia. [22] J. Mairal, F. Bach, J. Ponce, and G. Sapiro, “Online Learning for Matrix Factorization and Sparse Coding,” 2010 Journal of Machine Learning Research. [23] F. Bach, R. Jenatton, J. Mairal, and G. Obozinski, “Optimization with Sparsity-Inducing Penalties,” 2012 Foundations and Trends in Machine Learning. [24] R. Tibshirani, “Regression Shrinkage and Selection via the LASSO,” 1996 Journal of the Royal Stats. Society. [25] S. Mallat and Z. Zhang, “Matching Pursuits with Time-Frequency Dictionaries,” 1993 IEEE Transactions on Signal Processing.