Person:
Lin, Tsung-Han

Loading...
Profile Picture

Email Address

AA Acceptance Date

Birth Date

Research Projects

Organizational Units

Job Title

Last Name

Lin

First Name

Tsung-Han

Name

Lin, Tsung-Han

Search Results

Now showing 1 - 10 of 12
  • Thumbnail Image
    Publication
    Performance Gains in Conjugate Gradient Computation with Linearly Connected GPU Multiprocessors
    (USENIX Association, 2012) Tarsa, Stephen; Lin, Tsung-Han; Kung, H.
    Conjugate gradient is an important iterative method used for solving least squares problems. It is compute-bound and generally involves only simple matrix computations. One would expect that we could fully parallelize such computation on the GPU architecture with multiple Stream Multiprocessors (SMs), each consisting of many SIMD processing units. While implementing a conjugate gradient method for compressive sensing signal reconstruction, we have noticed that large speed-up due to parallel processing is actually infeasible due to the high I/O cost between SMs and GPU global memory. WE have found that if SMs were linearly connected, we could gain a 15x speedup by loop unrolling. We conclude that adding these relatively inexpensive neighbor connections for SMs can significantly enhance the applicability of GPUs to a large class of similar matrix computations.
  • Thumbnail Image
    Publication
    Achieving High Throughput Ground-to-UAV Transport via Parallel Links
    (IEEE, 2011) Lin, Chit-Kwan; Kung, H.; Lin, Tsung-Han; Tarsa, Stephen; Vlah, Dario
    Wireless data transfer under high mobility, as found in unmanned aerial vehicle (UAV) applications, is a challenge due to varying channel quality and extended link outages. We present FlowCode, an easily deployable link-layer solution utilizing multiple transmitters and receivers for the purpose of supporting existing transport protocols such as TCP in these scenarios. By using multiple transmitters and receivers and by exploiting the resulting antenna beam diversity and parallel transmission effects, FlowCode increases throughput and reception range. In emulation, we show that TCP over FlowCode gives greater goodput over a larger portion of the flight path, compared to an enhanced TCP protocol using the standard 802.11 MAC. In the process, we make a strong case for using trace-modulated emulation when developing distributed protocols for complex wireless environments.
  • Thumbnail Image
    Publication
    Parallelization Primitives for Dynamic Sparse Computations
    (2013) Lin, Tsung-Han; Tarsa, Stephen; Kung, H.
    We characterize a general class of algorithms common in machine learning, scientific computing, and signal processing, whose computational dependencies are both sparse, and dynamically defined throughout execution. Existing parallel computing runtimes, like MapReduce and GraphLab, are a poor fit for this class because they assume statically defined dependencies for resource allocation and scheduling decisions. As a result, changing load characteristics and straggling compute units degrade performance significantly. However, we show that the sparsity of computational dependencies and these algorithms’ natural error tolerance can be exploited to implement a flexible execution model with large efficiency gains, using two simple primitives: selective push-pull and statistical barriers. With reconstruction for compressive time-lapse MRI as a motivating application, we deploy a large Orthogonal Matching Pursuit (OMP) computation on Amazon’s EC2 cluster to demonstrate a 19x speedup over current static execution models.
  • Thumbnail Image
    Publication
    Measuring diversity on a low-altitude UAV in a ground-to-air wireless 802.11 mesh network
    (IEEE, 2010) Kung, H.; Lin, Chit-Kwan; Lin, Tsung-Han; Tarsa, Stephen; Vlah, Dario
    We consider the problem of mitigating a highly varying wireless channel between a transmitting ground node and receivers on a small, low-altitude unmanned aerial vehicle (UAV) in a 802.11 wireless mesh network. One approach is to use multiple transmitter and receiver nodes that exploit the channel's spatial/temporal diversity and that cooperate to improve overall packet reception. We present a series of measurement results from a real-world testbed that characterize the resulting wireless channel. We show that the correlation between receiver nodes on the airplane is poor at small time scales so receiver diversity can be exploited. Our measurements suggest that using several receiver nodes simultaneously can boost packet delivery rates substantially. Lastly, we show that similar results apply to transmitter selection diversity as well.
  • Thumbnail Image
    Publication
    A location-dependent runs-and-gaps model for predicting TCP performance over a UAV wireless channel
    (IEEE, 2010) Kung, H.; Lin, Chit-Kwan; Lin, Tsung-Han; Tarsa, Stephen; Vlah, Dario; Hague, Daniel; Muccio, Michael; Poland, Brendon; Suter, Bruce
    In this paper, we use a finite-state model to predict the performance of the Transmission Control Protocol (TCP) over a varying wireless channel between an unmanned aerial vehicle (UAV) and ground nodes. As a UAV traverses its flight path, the wireless channel may experience periods of significant packet loss, successful packet delivery, and intermittent reception. By capturing packet run-length and gap-length statistics at various locations on the flight path, this location-dependent model can predict TCP throughput in spite of dynamically changing channel characteristics. We train the model by using packet traces from flight tests in the field and validate it by comparing TCP throughput distributions for model-generated traces against those for actual traces randomly sampled from field data. Our modeling methodology is general and can be applied to any UAV flight path.
  • Thumbnail Image
    Publication
    FlowCode: Multi-site data exchange over wireless ad-hoc networks using network coding
    (IEEE, 2009) Kung, H.; Lin, Chit-Kwan; Lin, Tsung-Han; Tarsa, Stephen; Vlah, Dario
    We present FlowCode, a system that exploits network coding at the granularity of traffic flows to facilitate fault-tolerant data exchange in wireless mesh networks. Applications include multi-site data replication in ad-hoc environments such as mesh networks or wireless data centers. By coupling an operand-driven transmission mechanism with a layered network topology, FlowCode allows us to realize the gains of network coding in application systems without a global scheduler. We analyze the resulting gains through modeling and simulation and validate our results on an outdoor testbed of 12 wireless devices. Results indicate that in high loss environments, FlowCode provides the most significant gains from improved fault tolerance over redundant paths.
  • Thumbnail Image
    Publication
    Computing Sparse Representations in O(N log N) Time
    (Signal Processing with Adaptive Sparse Structured Representations (SPARS 2013), 2013) Lin, Tsung-Han; Kung, H.
  • Thumbnail Image
    Publication
    Stable and Efficient Sparse Recovery for Machine Learning and Wireless Communication
    (2014-06-06) Lin, Tsung-Han; Kung, H. T.; Gortler, Steven; Tarokh, Vahid; Adams, Ryan
    Recent theoretical study shows that the sparsest solution to an underdetermined linear system is unique, provided the solution vector is sufficiently sparse, and the operator matrix has sufficiently incoherent column vectors. In addition, efficient algorithms have been discovered to find such solutions. This intriguing result opens a new door for many potential applications. In this thesis, we study the design of a class of greedy algorithms that are extremely efficient, e.g., Orthogonal Matching Pursuit (OMP). These greedy algorithms suffer from a stability issue that the greedy selection approach always make locally optimal decisions, thereby easily biasing and mistaking the solutions in particular under data noise. We propose a solution approach that in designing greedy algorithms, new constraints can be devised by leveraging application-specific insights and incorporated into the algorithms. Given that sparse recovery problems by definition are underdetermined, introducing additional constraints can significantly improve the stability of greedy algorithms, yet retain their efficiency.
  • Thumbnail Image
    Publication
    Localization with Snap-Inducing Shaped Residuals (SISR) - Coping with Errors in Measurement
    (2009) Kung, H.; Lin, Chit-Kwan; Lin, Tsung-Han; Vlah, Dario
    We consider the problem of localizing wireless nodes in an outdoor, open-space environment, using ad-hoc radio ranging measurements, e.g., 802.11. As in other range-based methods, we cast ranging measurements as a set of distance constraints, thus forming an over-determined system of equations suitable for non-linear least squares optimization. However, ranging measurements are often subject to errors, induced by multipath signals and variations in path loss, or even faulty hardware or antenna connectors. Including such potentially large and non-Gaussian errors in the measurement data ultimately produces inaccurate localization solutions. We propose a new method, called snap-inducing shaped residuals (SISR), to automatically identify “bad nodes” and “bad links” arising from these errors, so that they receive less weight in the localization process. In particular, SISR snaps “good nodes” to their accurate locations and gives less emphasis to other nodes. While the mathematical techniques used by SISR are similar to those in robust statistics, SISR’s exploitation of the snap-in effect in localization appears to be novel. We provide analysis on the principle of SISR, and demonstrate a working SISR implementation in field experiments on a testbed of 20 wireless nodes, as well as the superior performance of SISR in simulation with a larger number of nodes.
  • Thumbnail Image
    Publication
    Identifying bad measurements in compressive sensing
    (2011) Kung, H.; Lin, Tsung-Han; Vlah, Dario
    We consider the problem of identifying bad measurements in compressive sensing. These bad measurements can be present due to malicious attacks and system malfunction. Since the system of linear equations in compressive sensing is underconstrained, errors introduced by these bad measurements can result in large changes in decoded solutions. We describe methods for identifying bad measurements so that they can be removed before decoding. In a new separation-based method we separate out top nonzero variables by ranking, eliminate the remaining variables from the system of equations, and then solve the reduced overconstrained problem to identify bad measurements. Comparing to prior methods based on direct or joint l1-minimization, the separation-based method can work under a much smaller number of measurements. In analyzing the method we introduce the notion of inversions which governs the separability of large nonzero variables.