2019
DOI: 10.1145/3301488
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting SIMD Asymmetry in ARM-to-x86 Dynamic Binary Translation

Abstract: Single instruction multiple data (SIMD) has been adopted for decades because of its superior performance and power efficiency. The SIMD capability (i.e., width, number of registers, and advanced instructions) has diverged rapidly on different SIMD instruction-set architectures (ISAs). Therefore, migrating existing applications to another host ISA that has fewer but longer SIMD registers and more advanced instructions raises the issues of asymmetric SIMD capability. To date, this issue has been overlooked and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 39 publications
0
6
0
Order By: Relevance
“…This dynamic technique enables short-SIMD binaries portability across newer, wider SIMD generations [151]. Spill-aware superword level parallelism (saSLP) [152] exploits the x86 AVX2 host's parallelism, gathers instructions, and registers capacity. To support that, it combines short ARMv8 instructions and registers in the guest binaries.…”
Section: ) Instruction-level Approachmentioning
confidence: 99%
“…This dynamic technique enables short-SIMD binaries portability across newer, wider SIMD generations [151]. Spill-aware superword level parallelism (saSLP) [152] exploits the x86 AVX2 host's parallelism, gathers instructions, and registers capacity. To support that, it combines short ARMv8 instructions and registers in the guest binaries.…”
Section: ) Instruction-level Approachmentioning
confidence: 99%
“…Vectorization methods for loops include SLP-oriented loop unrolling optimization [33][34][35], selection optimization of vector methods based on program parallelism features [36], and vector recognition optimization based on directed graph reachability [37]. In addition, SLP methods have also been applied in the fields of dynamic code conversion [38] and optimization of vector code in inline assembly form [39], etc.…”
Section: Wireless Communications and Mobile Computingmentioning
confidence: 99%
“…Figure 4 shows the workflow for binary lifting, including the three different IRs used throughout this process, namely, MCInst, MachineInstr, and finally the LLVM IR. By lifting the binary to the LLVM IR, we are able to re-optimize the program, enabling us to exploit features that are specific of the target ISA [40,46] or focus on a different objective function such as code-size reduction [20, 57ś60]. First, the source binary is disassembled to an array of MCInst, which is the lowest-level IR in LLVM, working as an in-memory representation of the disassembled binary code.…”
Section: Binary Liftingmentioning
confidence: 99%