Updated on Feb 6th 2024 based on the version endorsed by the Coreper I on Feb 2nd
Section 1: Information to be provided by all providers of general-purpose AI models
The technical documentation referred to in Article X (b) shall contain at least the following information as appropriate to the size and risk profile of the model:
1. A general description of the general purpose AI model including:
- the tasks that the model is intended to perform and the type and nature of AI systems in which it can be integrated;
- acceptable use policies applicable;
- the date of release and methods of distribution;
- the architecture and number of parameters;
- modality (e.g. text, image) and format of inputs and outputs;
- the license.
2. A detailed description of the elements of the model referred to in paragraph 1, and relevant information of the process for the development, including the
- the technical means (e.g. instructions of use, infrastructure, tools) required for the general-purpose AI model to be integrated in AI systems;
- the design specifications of the model and training process, including training methodologies and techniques, the key design choices including the rationale and assumptions made; what the model is designed to optimise for and the relevance of the different parameters, as applicable;
- information on the data used for training, testing and validation, where applicable, including type and provenance of data and curation methodologies (e.g. cleaning, filtering etc), the number of data points, their scope and main characteristics; how the data was obtained and selected as well as all other measures to detect the unsuitability of data sources and methods to detect identifiable biases, where applicable;
- the computational resources used to train the model (e.g. number of floating point operations – FLOPs), training time, and other relevant details related to the training;
- known or estimated energy consumption of the model; in case not known, this could be based on information about computational resources used ;
Section 2: Additional information to be provided by providers of general purpose AI model with systemic risk
3. Detailed description of the evaluation strategies, including evaluation results, on the basis of available public evaluation protocols and tools or otherwise of other evaluation methodologies. Evaluation strategies shall include evaluation criteria, metrics and the methodology on the identification of limitations.
4. Where applicable, detailed description of the measures put in place for the purpose of conducting internal and/or external adversarial testing (e.g. red teaming), model adaptations, including alignment and fine-tuning.
5. Where applicable, detailed description of the system architecture explaining how software components build or feed into each other and integrate into the overall processing.