Dennard scaling rule, which was proposed in 1974 and had been relevant for decades due to advances in the development of metal-oxide-semiconductor field-effect transistors (MOSFETs), has finally ran out of steam. [1,2] However, the need for energyefficient computing nowadays is even more urgent than ever before, covering numerous domains, such as large-scale sensor networks, Internet of Things (IoT), bioelectronics, and arguably, most importantly, neuromorphic computing applications. [3-18] While revisiting the reasons behind the failure of Dennard scaling rule, the following two major issues with the modern approach may be raised. First, the "Boltzmann limit," i.e., the statistical distribution of electrons in semiconductor energy bands sets the minimum ON/OFF switching for conventional transistors. The second is the "von Neumann bottleneck"; the performance of memory significantly lags behind that of the processor, which, in turn, leads to latency in conventional computer architectures. When higher performance is expected from the existing computer infrastructure, particularly, to simulate the brain architecture in neuromorphic computers, one would 1) sacrifice the leakage of transistors, e.g., by lowering the threshold voltage of a MOSFET, and/or 2) enlarge the size of memories, such as static random-access memories (SRAMs) and embedded dynamic random-access memories (DRAMs). However, both of the approaches result in an explosive use of chip energy consumption. With the drastic increase in device counts to meet the modern systems' requirements, it is critical to consider the development of a new building block with a fundamentally different mechanism as well as delivering a unified computingstorage functionality. [3,6,19,20] Spintronic devices have many unparalleled advantages, including data non-volatility, low operational power, radiation hardness, potential of scaling down to the sub-5 nm cell size, and natural inclination for multilevel 3D computing paradigms. [21-24] A primitive device structure for spintronics is the magnetic tunnel junction (MTJ) device, which consists of two ferromagnetic (FM) layers and a thin insulating layer (e.g., MgO) used as the tunneling barrier between the two FMs layers. Depending on the parallel (P) or anti-parallel (AP) relative orientation of the two FMs' magnetizations, the tunnel magnetoresistance (TMR) of an MTJ can be used as the read-out mechanism. As for the write mechanism, the spin-transfer torque (STT) carried by the injection current can be utilized to switch one of the FMs' magnetization, in analogy with the write process in a modern magnetization random-access memory (MRAM). [25] Recently, significant progress has been achieved to use separate paths for write and read operations in MRAMs, for example, by inducing spin-orbit torque (SOT) or voltage-controlled magnetization anisotropy (VCMA) effects, to further lower the operation energy and improve the write endurance. [26,27]