An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

Xia, Boming; Lu, Qinghua; Zhu, Liming; Xing, Zhenchang

An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

Date

2024-07-10

Authors

Xia, Boming

Lu, Qinghua

Zhu, Liming

Xing, Zhenchang

Publisher

Association for Computing Machinery (ACM)

Abstract

The advent of advanced AI underscores the urgent need for comprehensive safety evaluations, necessitating collaboration across communities (i.e., AI, software engineering, and governance). However, divergent practices and terminologies across these communities, combined with the complexity of AI systems - of which models are only a part - and environmental affordances (e.g., access to tools), obstruct effective communication and comprehensive evaluation. This paper proposes a framework for AI system evaluation comprising three components: 1) harmonised terminology to facilitate communication across communities involved in AI safety evaluation; 2) a taxonomy identifying essential elements for AI system evaluation; 3) a mapping between AI lifecycle, stakeholders, and requisite evaluations for accountable AI supply chain. This framework catalyses a deeper discourse on AI system evaluation beyond model-centric approaches.

Keywords

AI Safety, AI Testing, Benchmarking, Evaluation, Responsible AI

URI

http://www.scopus.com/inward/record.url?scp=85199903661&partnerID=8YFLogxK
https://hdl.handle.net/1885/733751677

Collections

ANU Research Publications

Type

Conference paper

Book Title

AIware 2024 - Proceedings of the 1st ACM International Conference on AI-Powered Software, Co-located with: ESEC/FSE 2024

Entity type

Publication

DOI

10.1145/3664646.3664766

Downloads

File

Description

3664646.3664766.pdf (485.36 KB)

Full item page

Cultural advice

An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads