The growing interest in artificial intelligence (AI) and Large Language Models (LLMs) in particular, such as Generative Pre-trained Transformers (GPT) and Bidirectional Encoder Representations from Transformers (BERT) (Devlin et al., 2018;Brown et al., 2020;Ouyang et al., 2022), have been gaining more and more interest in both academia and with the public at large. While the results from such LLMs might seem plausible, they can also be questionable or totally incorrect as discussed in the past editorial by Batty (2023) who suggested that "we need to combine AI with our own knowledge and intuitions." Similar notions have been suggested elsewhere by Fu et al. ( 2023) who compared ChatGPT from OpenAI with manual evaluations of climate change plans and noticed that the machine-generated evaluations struggled with planning-specific jargon.Nonetheless, over the last year, LMMs have started to be introduced and utilized in GIScience. This is especially the case as newer versions of ChatGPT have been released with more advanced capabilities, including more robust reasoning, longer context windows and multimodal functionalities (e.g., text and images). These improvements have enabled users using natural language to interact with the system for diverse tasks, ranging from translation, classification, image, and code generation (e.g., GitHub's Copilot) to information retrieval, and mapping. For example, Hu et al. ( 2023) combined geographical knowledge with GPT models to recognize geographical location descriptions in social media posts for supporting disaster response and management while Jang et al. ( 2023) used ChatGPT and DALL.E3 (a text-to-image generation model) to study place identity and found that both can capture the salient features of the city of interest. Others like Tao and Xu (2023) examined the capability of ChatGPT in different map making tasks (e.g., thematic and mental maps) based on either publicly available geographical data or conversation-based textual descriptions of geographic space. Similarly, Li and Ning (2023) introduced an autonomous GIS prototype by leveraging LLMs for tasks like geographical data collection, analysis, and visualization through natural language prompts. Some are calling this GeoQA (Geographic Question Answering, Feng et al., 2023) whereby researchers utilize LLMs to answer geographic questions in natural language. These studies, and others alike, align with the notion of "(n)ext generation of GIS: must be easy (Zhu et al., 2021)," emphasizing the shift towards more accessible and user-friendly GIS and spatial analysis technologies.What does this mean for urban analytics? Again, quoting from a past editorial by Batty (2019) the "term analytics implies a set of methods that can be used to explore, understand and predict properties and features of any system, in our case of cities." One example is the use of street view images to extract and understand the properties of the system or the space within a city. In the past, training and segmentation of such data was a time consumin...