Enabling scalability-sensitive speculative parallelization for FSM computations

Junqiao Qiu, Zhijia Zhao, Bo Wu, Abhinav Vishnu, Shuaiwen Leon Song

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

17 Citations (Scopus)

Abstract

Finite state machines (FSMs) are the backbone of many applications, but are difficult to parallelize due to their inherent dependencies. Speculative FSM parallelization has shown promise on multicore machines with up to eight cores. However, as hardware parallelism grows (e.g., Xeon Phi has up to 288 logical cores), a fundamental question raises: How does the speculative FSM parallelization scale as the number of cores increases? Without answering this question, existing methods for speculative FSM parallelization simply choose to use all available cores, which might not only waste computing resources, but also result in suboptimal performance. In this work, we conduct a systematic scalability analysis for speculative FSM parallelization. Unlike many other parallelizations which can be modeled by the classic Amdahl's law or its simple extensions, speculative FSM parallelization scales unconventionally due to the non-deterministic nature of speculation and the cost variations of misspeculation. To address these challenges, this work introduces a spectrum of scalability models that are customized to the properties of specific FSMs and the underlying architecture. The models, for the first time, precisely capture the scalability of speculative parallelization for different FSM computations, and clearly show the existence of a "sweet spot" in terms of the number of cores employed by the speculative FSM parallelization to achieve the optimal performance. To make the scalability models practical, we develop S3, a scalability-sensitive speculation framework for FSM parallelization. For any given FSM, S3 can automatically characterize its properties and analyze its scalability, hence guide speculative parallelization towards the optimal performance and more efficient use of computing resources. Evaluations on different FSMs and architectures confirm the accuracy of the proposed models and show that S3 achieves significant speedup (up to 5X) and energy savings (up to 77%).
Original languageEnglish
Title of host publicationICS 2017: International Conference on Supercomputing
PublisherAssociation for Computing Machinery
VolumePart F128411
ISBN (Print)9781450350204
DOIs
Publication statusPublished - 14 Jun 2017
Externally publishedYes
Event31st ACM International Conference on Supercomputing, ICS 2017 - Chicago, United States
Duration: 13 Jun 201716 Jun 2017
http://www.ics-conference.org

Publication series

NameProceedings of the International Conference on Supercomputing
VolumePart F128411

Conference

Conference31st ACM International Conference on Supercomputing, ICS 2017
PlaceUnited States
CityChicago
Period13/06/1716/06/17
Internet address

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Finite State Machine
  • FSM
  • Parallelization
  • Scalability
  • Speculation

Fingerprint

Dive into the research topics of 'Enabling scalability-sensitive speculative parallelization for FSM computations'. Together they form a unique fingerprint.

Cite this