With widespread adoption of AI models for important decision making, ensuring reliability of such models remains an important challenge. In this paper, we present an end-to-end generic framework for testing AI Models which performs automated test generation for different modalities such as text, tabular, and time-series data and across various properties such as accuracy, fairness, and robustness. Our tool has been used for testing industrial AI models and was very effective to uncover issues present in those models. Demo video link: https://youtu.be/984UCU17YZI
翻译:由于广泛采用AI模型进行重要决策,确保这类模型的可靠性仍是一项重大挑战,本文件提出一个测试AI模型的端至端通用框架,用于测试AI模型,该模型对文本、表格和时间序列数据等不同模式以及准确性、公正性和稳健性等各种特性进行自动测试生成。我们的工具被用于测试工业AI模型,并且非常有效地揭示这些模型中存在的问题。Demo视频链接:https://youtu.be/984UCU17YZI。