Feedforward visual object perception recruits a cortical network that is assumed to be hierarchical, progressing from basic visual features to complete object representations. However, the nature of the intermediate features related to this transformation remains poorly understood. Here, we explore how well different computer vision recognition models account for neural object encoding across the human cortical visual pathway as measured using fMRI. These neural data, collected during the viewing of 60 images of real-world objects, were analyzed with a searchlight procedure as in Kriegeskorte, Goebel, and Bandettini (2006): Within each searchlight sphere, the obtained patterns of neural activity for all 60 objects were compared to model responses for each computer recognition algorithm using representational dissimilarity analysis (Kriegeskorte et al., 2008). Although each of the computer vision methods significantly accounted for some of the neural data, among the different models, the scale invariant feature transform (Lowe, 2004), encoding local visual properties gathered from "interest points," was best able to accurately and consistently account for stimulus representations within the ventral pathway. More generally, when present, significance was observed in regions of the ventral-temporal cortex associated with intermediate-level object perception. Differences in model effectiveness and the neural location of significant matches may be attributable to the fact that each model implements a different featural basis for representing objects (e.g., more holistic or more parts-based). Overall, we conclude that well-known computer vision recognition systems may serve as viable proxies for theories of intermediate visual object representation.
Skilled cardiologists perform cardiac auscultation, acquiring and interpreting heart sounds, by implicitly carrying out a sequence of steps. These include discarding clinically irrelevant beats, selectively tuning in to particular frequencies and aggregating information across time to make a diagnosis. In this paper, we formalize a series of analytical stages for processing heart sounds, propose algorithms to enable computers to approximate these steps, and investigate the effectiveness of each step in extracting relevant information from actual patient data. Through such reasoning, we provide insight into the relative difficulty of the various tasks involved in the accurate interpretation of heart sounds. We also evaluate the contribution of each analytical stage in the overall assessment of patients. We expect our framework and associated software to be useful to educators wanting to teach cardiac auscultation, and to primary care physicians, who can benefit from presentation tools for computer-assisted diagnosis of cardiac disorders. Researchers may also employ the comprehensive processing provided by our framework to develop more powerful, fully automated auscultation applications.
The mid- and high-level visual properties supporting object perception in the ventral visual pathway are poorly understood. In the absence of well-specified theory, many groups have adopted a data-driven approach in which they progressively interrogate neural units to establish each unit's selectivity. Such methods are challenging in that they require search through a wide space of feature models and stimuli using a limited number of samples. To more rapidly identify higher-level features underlying human cortical object perception, we implemented a novel functional magnetic resonance imaging method in which visual stimuli are selected in real-time based on BOLD responses to recently shown stimuli. This work was inspired by earlier primate physiology work, in which neural selectivity for mid-level features in IT was characterized using a simple parametric approach (Hung et al., 2012). To extend such work to human neuroimaging, we used natural and synthetic object stimuli embedded in feature spaces constructed on the basis of the complex visual properties of the objects themselves. During fMRI scanning, we employed a real-time search method to control continuous stimulus selection within each image space. This search was designed to maximize neural responses across a pre-determined 1 cm3 brain region within ventral cortex. To assess the value of this method for understanding object encoding, we examined both the behavior of the method itself and the complex visual properties the method identified as reliably activating selected brain regions. We observed: (1) Regions selective for both holistic and component object features and for a variety of surface properties; (2) Object stimulus pairs near one another in feature space that produce responses at the opposite extremes of the measured activity range. Together, these results suggest that real-time fMRI methods may yield more widely informative measures of selectivity within the broad classes of visual features associated with cortical object representation.
Background: The prevalence of unreported concussions is high, and undiagnosed concussions can lead to worse postconcussion outcomes. It is not clear how those with a history of undiagnosed concussion perform on subsequent standard concussion baseline assessments. Purpose: To determine if previous concussion diagnosis status was associated with outcomes on the standard baseline concussion assessment battery. Study Design: Cross-sectional study; Level of evidence, 3. Methods: Concussion Assessment, Research, and Education (CARE) Consortium participants (N = 29,934) self-reported concussion history with diagnosis status and completed standard baseline concussion assessments, including assessments for symptoms, mental status, balance, and neurocognition. Multiple linear regression models were used to estimate mean differences and 95% CIs among concussion history groups (no concussion history [n = 23,037; 77.0%], all previous concussions diagnosed [n = 5315; 17.8%], ≥1 previous concussions undiagnosed [n = 1582; 5.3%]) at baseline for all outcomes except symptom severity and Brief Symptom Inventory–18 (BSI-18) score, in which negative binomial models were used to calculate incidence rate ratios (IRRs). All models were adjusted for sex, race, ethnicity, sport contact level, and concussion count. Mean differences with 95% CIs excluding 0.00 and at least a small effect size (≥0.20), and those IRRs with 95% CIs excluding 1.00 and at least a small association (IRR, ≥1.10) were considered significant. Results: The ≥1 previous concussions undiagnosed group reported significantly greater symptom severity scores (IRR, ≥1.38) and BSI-18 (IRR, ≥1.31) scores relative to the no concussion history and all previous concussions diagnosed groups. The ≥1 previous concussions undiagnosed group performed significantly worse on 6 neurocognitive assessments while performing better on only 2 compared with the no concussion history and all previous concussions diagnosed groups. There were no between-group differences on mental status or balance assessments. Conclusion: An undiagnosed concussion history was associated with worse clinical indicators at future baseline assessments. Individuals reporting ≥1 previous undiagnosed concussions exhibited worse baseline clinical indicators. This may suggest that concussion-related harm may be exacerbated when injuries are not diagnosed.
Contextual associations facilitate object recognition in human vision. However, the role of context in artificial vision remains elusive as does the characteristics that humans use to define context. We investigated whether contextually related objects (bicycle-helmet) are represented more similarly in convolutional neural networks (CNNs) used for image understanding than unrelated objects (bicycle-fork). Stimuli were of objects against a white background and consisted of a diverse set of contexts (N=73). CNN representations of contextually related objects were more similar to one another than to unrelated objects across all CNN layers. Critically, the similarity found in CNNs correlated with human behavior across three experiments assessing contextual relatedness, emerging significant only in the later layers. The results demonstrate that context is inherently represented in CNNs as a result of object recognition training, and that the representation in the later layers of the network tap into the contextual regularities that predict human behavior.
No abstract
Subconcussive head injuries are connected to both short-term cognitive changes and long-term neurodegeneration. Further study is required to understand what types of subconcussive impacts might prove detrimental to cognition. We studied cadets at the US Air Force Academy engaged in boxing and physical development, measuring head impact motions during exercise with accelerometers. These head impact measures were compared with post-exercise memory performance. Investigators explored multiple techniques for characterizing the magnitude of head impacts. Boxers received more head impacts and achieved lower performance in post-exercise memory than non-boxers. For several measures of impact motion, impact intensity appeared to set an upper bound on post-exercise memory performance – stronger impacts led to lower expected memory performance. This trend was most significant when impact intensity was measured through a novel technique, applying principal component analysis to boxer motion. Principal component analysis measures also captured more distinct impact information than seven traditional impact measures also tested.
The properties utilized by visual object perception in the mid- and high-level ventral visual pathway are poorly understood. To better establish and explore possible models of these properties, we adopt a data-driven approach in which we repeatedly interrogate neural units using functional Magnetic Resonance Imaging (fMRI) to establish each unit’s image selectivity. This approach to imaging necessitates a search through a broad space of stimulus properties using a limited number of samples. To more quickly identify the complex visual features underlying human cortical object perception, we implemented a new functional magnetic resonance imaging protocol in which visual stimuli are selected in real-time based on BOLD responses to recently shown images. Two variations of this protocol were developed, one relying on natural object stimuli and a second based on synthetic object stimuli, both embedded in feature spaces based on the complex visual properties of the objects. During fMRI scanning, we continuously controlled stimulus selection in the context of a real-time search through these image spaces in order to maximize neural responses across predetermined 1 cm3 brain regions. Elsewhere we have reported the patterns of cortical selectivity revealed by this approach (Leeds 2014). In contrast, here our objective is to present more detailed methods and explore the technical and biological factors influencing the behavior of our real-time stimulus search. We observe that: 1) Searches converged more reliably when exploring a more precisely parameterized space of synthetic objects; 2) Real-time estimation of cortical responses to stimuli are reasonably consistent; 3) Search behavior was acceptably robust to delays in stimulus displays and subject motion effects. Overall, our results indicate that real-time fMRI methods may provide a valuable platform for continuing study of localized neural selectivity, both for visual object representation and beyond.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.