Explainability for Large Language Models: A Survey

Zhao, Haiyan; Chen, Hanjie; Yang, Fan; Liu, Ninghao; Deng, Huiqi; Cai, Hengyi; Wang, Shuaiqiang; Yin, Dawei; Du, Mengnan

doi:10.1145/3639372

Cited by 38 publications

(7 citation statements)

References 101 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover LLMs are often seen as "black boxes" due to their complex and opaque nature, making it difficult to understand how they process data and arrive at specific outputs [63]. This lack of transparency can hinder the identification and rectification of privacy and security issues within the model.…”

Section: Limitationsmentioning

confidence: 99%

Leveraging Large Language Models for Sensor Data Retrieval

Berenguer,

Morejón,

Tomás

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

The growing significance of sensor data in the development of information technology services finds obstacles due to disparate data presentations and non-adherence to FAIR principles. This paper introduces a novel approach for sensor data gathering and retrieval. The proposal leverages large language models to convert sensor data into FAIR-compliant formats and to provide word embedding representations of tabular data for subsequent exploration, enabling semantic comparison. The proposed system comprises two primary components. The first focuses on gathering data from sensors and converting it into a reusable structured format, while the second component aims to identify the most relevant sensor data to augment a given user-provided dataset. The evaluation of the proposed approach involved comparing the performance of various large language models in generating representative word embeddings for each table to retrieve related sensor data. The results show promising performance in terms of precision and MRR (0.90 and 0.94 for the best-performing model, respectively), indicating the system’s ability to retrieve pertinent sensor data that fulfil user requirements.

show abstract

Section: Limitationsmentioning

confidence: 99%

Leveraging Large Language Models for Sensor Data Retrieval

Berenguer,

Morejón,

Tomás

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…Ensuring sufficient interpretability can help AI research scientists and developers to debug the models they are building and to uncover otherwise hidden or unforeseeable failure modes, thereby improving downstream model functioning and performance (Bastings et al, 2022;Luo & Specia, 2024;. It can also help detect and mitigate discriminatory biases that may be buried within model architectures (Alikhademi et al, 2021;Zhao, Chen, et al, 2024;Zhou et al, 2020). Furnishing understandable and accessible explanations of the rationale behind system outputs can likewise help to establish the lawfulness of AI systems (e.g., their compliance with data protection law and equality law) (Chuang et al, 2024; ICO/Turing, 2020) as well as to ensure responsible and trustworthy implementation by system deployers, who are better equipped to grasp system capabilities, limitations, and flaws and to integrate system outputs into their own reasoning, judgment, and experience (ICO/Turing, 2020; Leslie, Rincón, et al, 2024).…”

Section: Risks From Model Scaling: Model Opacity and Complexitymentioning

confidence: 99%

“…While the field of explainable AI (often referred to simply as XAI) has made notable progress over the past several years in advancing knowledge about the behaviors and potential flaws of opaque AI systems (Angelov et al, 2021;Räuker et al, 2023;Zhao, Chen, et al, 2024), myriad critical voices have emphasized that applications of contemporary AI explainability methods to black-box AI systems are rife with shortcomings that continue to hamper their real-world utility. These critics have cautioned against 'false hopes' that current explainability techniques provide justified reassurance about the safety, accuracy, reliability, and fairness of black-box models, stressing that contemporary approaches often generate misleading or unfaithful explanations (Ghassemi et al, 2021, p. e746).…”

Section: Risks From Model Scaling: Model Opacity and Complexitymentioning

confidence: 99%

“…The new order and scale of complexity of FMs has only compounded these difficulties faced by conventional XAI techniques in attempting to reliably and accurately explain deep learning models. Though some interpretability researchers, working on FMs/LLMs, have endeavored to build on local, feature attributionbased methods (Kokalj et al, 2021;Sanyal & Ren, 2021;Sikdar et al, 2021;Singh et al, 2024), the scaling of model depth and complexity has largely rendered traditional XAI methods unfit, incapable, or even obsolete (Chuang et al, 2024;Wu et al, 2024;Zhao, Chen, et al, 2024;Zou et al, 2023).…”

Section: Risks From Model Scaling: Model Opacity and Complexitymentioning

confidence: 99%

“…In response to the unprecedented demand for approaches to interpretability that can access the mechanics and higher level cognitive functions of complex model architectures, researchers have started to develop novel techniques that strive to open the FM/LLM black box. The most prominent of these can be organized into three categories: top-down methods of representation engineering, bottom-up methods of mechanistic interpretability, and outside-in methods of prompt-based self-explanation and prediction decomposition (Singh et al, 2024;Zhao, Chen, et al, 2024). While each of these emerging techniques heralds prospects of increasing model transparency, they are all also fraught with significant and yet-to-be-addressed problems.…”

Section: Risks From Model Scaling: Model Opacity and Complexitymentioning

confidence: 99%

See 2 more Smart Citations

Future Shock: Generative AI and the International AI Policy and Governance Crisis

Leslie,

Perini

2024

Harvard Data Science Review

View full text Add to dashboard Cite

stakeholders from across industry, academia, government, and civil society, and from around the globe, had made concerted efforts to develop standards, policies, and governance mechanisms to ensure the ethical, responsible, and equitable production and use of AI systems.However, as we then show, despite these ostensibly supportive activities and background conditions, several primary drivers of future shock converged to produce an international AI policy and governance crisis in the wake of the dawning of the GenAI era. Such a crisis, we argue, was marked by the disconnect between the strengthening thrust of public concerns about the hazards posed by the hasty industrial scaling of GenAI and the absence of effectual regulatory mechanisms and needed policy interventions to address such hazards. In painting a broad-stroked picture of this crisis, we underscore two sets of contributing factors. First, there have been factors that have demonstrated the absence of various vital aspects of AI policy and governance capability and execution-and thus the absence of key preconditions for readiness and resilience in managing technological transformation. These include prevalent enforcement gaps in existing digital-and data-related laws (e.g., intellectual property and data protection statutes), a lack of regulatory AI capacity, democratic deficits in the production of standards for trustworthy AI, and widespread evasionary tactics of ethic washing and state-enabled deregulation.Second, there have been factors that have significantly contributed to the presence of a new scale and order of systemic-, societal-, and biospheric-level risks and harms. Chief among these were the closely connected dynamics of unprecedented scaling and centralization that emerged as both drivers and by-products of the GenAI revolution. We focus, in particular, on model scaling and industrial scaling. Whereas the scaling of data, model size, and compute were linked to the emergence of serious model intrinsic risks deriving from the unfathomability of training data, model opacity and complexity, emergent model capabilities, and exponentially expanding compute costs, the rapid industrialization of FMs and GenAI systems meant the onset of a new scale of systemic risks that spanned the social, political, economic, cultural, and natural ecosystems in which these systems were embedded. The brute-force commercialization of GenAI ushered in a new age of widespread exposure in which increasing numbers of impacted people and communities at large were made susceptible to the risks and harms issuing from model scaling and to new possibilities for misuse, abuse, and cascading system-level effects.Alongside these aspects of model scaling and industrial scaling, patterns of economic and geopolitical centralization only further intensified conditions of future shock. The steering and momentum of these scaling dynamics lay largely in the hands of a few large tech corporations, which essentially controlled the data, compute, and skills and knowledge infrastructures r...

show abstract

A Blockchain-Based Approach for Model Card Accountability and Regulatory Compliance

Lohachab,

Urovi

2024

Lecture Notes in Business Information Processing

View full text Add to dashboard Cite

Explainability for Large Language Models: A Survey

Cited by 38 publications

References 101 publications

Leveraging Large Language Models for Sensor Data Retrieval

Leveraging Large Language Models for Sensor Data Retrieval

Future Shock: Generative AI and the International AI Policy and Governance Crisis

A Blockchain-Based Approach for Model Card Accountability and Regulatory Compliance

Contact Info

Product

Resources

About