Risk Identification

Risk Tolerance & Analysis

Risk Mitigation

Risk Identification

‍In risk identification, we assess whether an AI developer is: 

  • Approaching in an appropriate way risks outlined by the literature.
  • Doing extensive open-ended red teaming to identify new risks.
  • Leveraging a diverse range of risk identification techniques, including threat modeling when appropriate, to adequately identify new threats.
Risk Identification
  • Mistral analyzes bias in their first mixture-of-experts model, 'Mixtral of Experts', using the BOLD and BBQ benchmarks, comparing results to Llama 2.


  • Mistral has not discussed bias or any other risks in releases following 'Mixtral of Experts' (December 11, 2023).
  • Mistral provides no evidence of open-ended red teaming, threat modeling, or other risk identification techniques.
  • The only mitigation measure discussed by Mistral demonstrates a lack of threat and risk modeling. For their first model, 'Mistral 7B', they introduced a system prompt to reduce harmful outputs. However, this approach is ineffective against misuse, as malicious actors can simply omit the prompt.
Risk Tolerance
& Analysis

In risk tolerance and analysis, we assess whether the AI developers have defined:

  • A global risk tolerance.
  • Operational capabilities thresholds and their equivalent risk. Those have to be defined with precision and breadth.
  • Corresponding objectives of risk mitigation measures: AI developers should establish clear objectives for risk mitigation measures. These objectives should be grounded in strong rationales, including threat modeling, to justify that they are sufficient to address the identified risks and align with the organization's risk tolerance.
  • Evaluation protocols detailing procedures for measuring the model's capabilities and ensuring that capability thresholds are not exceeded without detection.
Global Risk Tolerance
  • Mistral does not state any global risk tolerance, even qualitatively.
Operational Risk Tolerance
  • Mistral provides no information on capability thresholds, mitigation objectives, or assurance properties.

Evaluation Protocols
Evaluation Protocols


  • Mistral provides no information about evaluation protocols for dangerous capabilities.
Risk Mitigation

In risk mitigation, we assess whether:

  • The proposed risk mitigation measures, which include both deployment and containment strategies, are well-planned and clearly specified.
  • There is a strong case for assurance properties to actually reduce risks, and the assumptions these properties are operating under are clearly stated.
Containment Measures
  • Mistral describes no containment measures.
Deployment Measures
  • Mistral describes no threat model–relevant deployment measures.
Assurance Properties
Assurance Properties


  • Mistral provides no information regarding the pursuit of assurance properties.
Best-in-class: These are elements where the company outperforms all the others. They represent industry-leading practices.
Highlights: These are the company's strongest points within the category, justifying its current grade.
Weaknesses: These are the areas that prevent the company from achieving a higher score.


The main source of information available for Mistral AI is the ‘news’ page on their website.