“…Simply put, we first trained a CNN architecture including two hidden layers and an output layer to decouple the light-field image (x,y,θ, ϕ) and interference from a raw image. Then we reconstructed the depth map (x,y,z) from the light-field image using a disparity estimation algorithm based on scale-depth space transform [27], while a 3D spectral datacube (x,y,λ) of the subject was derived from the interference by Fourier transform [28], [29]. We thoroughly described the theoretical model and reconstruction algorithm of the complete plenoptic imaging in Ref [25].…”