Automation of shoe metrology is crucial to providing fit information for shoes on a large scale. Here, we examine a segmentation technique to extract the inner shoe volume (ISV) from Computed Tomography (CT) data-the proposed approach leverages artificial neural networks to extract shoe parts for automated metrology precisely. The neural network architecture is customized to facilitate the extraction of ISV by integrating spatial attention mechanisms. Furthermore, a neural network segmentation algorithm removes filler materials virtually. This process yields enhancements of 1.3% in F1-score through material removal and an additional 1.4% through the incorporation of spatial attention. Notably, spatial attention mechanisms yield improved outcomes at the aperture of the shoe. The elimination of filler materials reduces false positive segmentations. The segmentation outcomes are utilized to generate surface meshes. These are compared to surface meshes derived from annotated data. We measure an average Hausdorff distance between annotated and labelled data of 2.1 mm. The discrepancy is primarily attributed to deformations and artifacts. On both sets, we measure the effective shoe length. Precision and accuracy metrics for the extracted measurement from ANN-segmented data attain 0.8 mm and 1.8 mm, respectively. For meshes obtained from label data, the precision is 0.2 mm, and the accuracy is 2.5 mm. Our findings underscore the accuracy of the extracted shoe interior volumes, rendering them suitable for metrological applications. Limitations include unsolved issues with separation reliability and deformation.