In this paper, the Kyle model of insider trading is extended by characterizing the trading volume with long memory and allowing the noise trading volatility to follow a general stochastic process. Under this newly revised model, the equilibrium conditions are determined, with which the optimal insider trading strategy, price impact and price volatility are obtained explicitly. The volatility of the price volatility appears excessive, which is a result of the fact that a more aggressive trading strategy is chosen by the insider when uninformed volume is higher. The optimal trading strategy turns out to possess the property of long memory, and the price impact is also affected by the fractional noise. constant since new private information is introduced every period caused by the deterministic changes in the noise trading volatility. Stochastic variation was also introduced in the noise trading volume by Foster and Viswanathan [15], and this model was then empirically tested to show the joint behavior of the price volatility, volume, and price impact. However, a noticeable shortcoming of these extensions is the assumption that the information is short-lived, which is not realistic. Later on, deterministic noise trading volatility was directly incorporated into the Kyle model so that intra-day patterns of liquidity trading can be successfully captured [4]. Such an assumption on the noise trading volatility is still not appropriate since it gives rise to the deterministic price impact and further leads to the situation where expected execution costs of liquidity traders do not depend on the timing of their trading. A more recent extension to the Kyle model was proposed by Collin-Dufresne and Fos [11], who considered stochastic noise trading volatility, and showed that this is the only guarantee that insider's liquidity timing option can generate a stochastic price volatility as well as a nonzero correlation among the price volatility, market depth, and uninformed trading volume.Despite all the advantages of the model proposed in [11], it has not taken into consideration that the volatility is characterized by long memory, which is a consensus among financial econometricians. In particular, Bollerslev and Jubinski [7] presented the evidence of long memory in the volatility and investigated the extent to which the volume and volatility share common long-run dependencies. In line with this, they also proposed that the mixture-of-distributions hypotheses (MDH) might be better viewed as long-run proposition, after observing that occasional structural breaks can give rise to long-range dependence [17]. This implies long-range dependence could be induced under the MDH by occasional changes in the mean and/or volatility of the intra-day volumes and returns, which are generated due to the reaction traders make to information events. Furthermore, fractionally-integrated models were shown to provide a useful description of volatility dynamics in the presence of structural breaks because they effectively allow the unconditional ...