Are AI companies committed to a safe future?

Featured in

our mission

We rate frontier AI companies' risk management practices. Our objective is to make frontier AI safer through improved risk management practices and to enhance the accountability of private actors shaping the development of AI.

Our complete methodology can be found here.

Filter by companies:

All
Frontier Companies

Click logos to visit company pages

Risk Management Framework Maturity of AI Companies (as of July 2025):

Click logos to visit company pages

very weak

weak

moderate

substantial

strong

35%

33%

25%

22%

20%

19%

18%

16%

11%

10%

What can be achieved

35%

is currently the best overall rating, given to Anthropic.

53%

is currently the best overall rating if a company adopted all the industry best practices found across companies.

Why we are rating AI companies

Advanced AI systems are arguably one of the most transformative technologies ever built.

Frontier AI companies are advancing technology at an astonishing rate, leading the three most cited AI researchers—Geoffrey Hinton, Ilya Sutskever, and Yoshua Bengio—to believe it will have a profound impact on societies and could even potentially contribute to their downfall.

Yet, we don’t currently know how to properly manage AI risks. Frontier models are quickly becoming more capable, and they already pose potential large-scale risks. For example, they can now significantly help people with basic technical backgrounds create biological weapons. As AI capabilities continue to advance rapidly, the gap between these growing risks and our inadequate risk management frameworks widens. This situation underscores the urgent need for robust risk management practices in the AI industry.

HOW We selected companies

At the AI Seoul Summit, multiple companies pledged to publish safety frameworks by February 2025, detailing how they manage risks when developing and deploying frontier AI models and systems. We have assessed the 12 companies that fulfilled this commitment: Amazon, Anthropic, Cohere, G42, Google DeepMind, Magic, Meta, Microsoft, Naver, NVIDIA, OpenAI, and xAI.

Six companies: 01.AI, Inflection AI, Minimax, Mistral AI, Technology Innovation Institute, and Zhipu AI, have not published their frameworks despite their commitments. Additionally, IBM summarized its existing AI governance practices without addressing severe AI risks, while Samsung published a framework limited to on-device AI safety, neither of which met our criteria for frontier AI risk management evaluation. These approaches fall outside the scope of frontier AI risk management.

Without published frameworks, these companies cannot demonstrate any risk management practices, resulting in the lowest possible assessment, a score of 0%.

Filter:

All
Frontier Companies

Click logos to visit company pages

Overall score

Anthropic

35%

33%

25%

22%

20%

19%

18%

16%

11%

10%

1. Risk Identification

OpenAI

26%

32%

17%

23%

20%

11%

14%

12%

2. Risk Analysis and Evaluation

Risk Management Reporting in four Dimensions

We assess AI companies' frontier safety frameworks, which outline their risk management practices. At a high level, these frameworks should address the following 4 dimensions:

Risk identification

To what extent has the AI company identified potential risks comprehensively?

Risk Analysis and Evaluation

Has the AI company clearly defined acceptable risk levels and established measurable thresholds for both risk and mitigation levels?

Risk treatment

Does the AI company implement adequate mitigation measures and continuously monitor both risk levels and mitigation effectiveness to ensure risks remain below defined tolerances?

Risk governance

Has the AI company established clear organizational structures, accountability mechanisms, and transparency practices to ensure consistent and rigorous risk management?

Conclusion

All companies currently have weak to very weak risk management practices.

If a company were to apply all the best practices currently found across the other companies, they would achieve a score of 53%, a substantial improvement over the current best score of 35%.

As AI systems become capable of causing large-scale physical harm, the current state of risk management is unacceptable. Companies must urgently prioritize improving their practices.

We believe that these ratings and templates will equip companies with the tools to develop more comprehensive and transparent AI risk management procedures and will result in improved safety across the board.

We intend to continually update our ratings as companies improve their policies. Ultimately, we aim for these ratings to be used across the AI deployer and investment community.

Current strengths

Risk identification is partially addressed, with companies typically covering 2-4 risks, though many improperly exclude the risks that they do not fully understand. Companies generally address core risks from domains such as CBRN (Chemical, Biological, Nuclear, Radiological), cyber offense, and in a few cases, automated AI R&D. Some companies, like GoogleDeepMind and OpenAI, show sophistication by breaking down complex loss of control risks into granular categories such as deceptive alignment and autonomous replication.
Some companies recognize the importance of risk modeling and define measurable Key Risk Indicators (KRIs) relatively well. Companies have established KRI thresholds such as capability thresholds, which indicate types of evaluations that will be needed to determine if the capability threshold has been crossed. However, KRIs should in general map to the actual evaluations performed, which is not often described currently in companies’ frameworks.
Some companies have established clear risk governance structures. For instance, companies like G42, OpenAI, Microsoft, and Anthropic have created specialized governance bodies including dedicated risk officers, safety advisory groups, and risk councils that represent meaningful institutionalization of risk management. Nonetheless, these remain fairly high-level and crucial details about final decision authority, oversight mechanisms, and handling of edge cases, are lacking.
Additional areas where companies show promise include early steps towards transparency and inclusion of third parties. Companies like Anthropic, Microsoft, and Cohere have made commitments to share evaluation results with stakeholders, engage third parties in assessments, and report externally on governance structures, representing positive movement toward industry accountability.

Opportunities for improvement

Risk tolerance (the maximum level of risk companies are willing to impose on society) is almost universally undefined, creating a fundamental gap where companies set capability thresholds and select mitigations without explicit bounds to ensure risks remain acceptable. This fundamental weakness undermines the entire risk management process, as companies cannot make principled decisions about acceptable risk levels without explicit tolerance thresholds. Instead, the criteria for these decisions remain discretionary and vague.
Open-ended red teaming, which is necessary to discover unknown, emergent risks, remains largely absent from frameworks, leaving companies unprepared for unforeseen dangers that arise as AI capabilities advance beyond the existing risk categories. Most companies focus narrowly on known risk domains without having systematic processes for identifying novel threats that may emerge as capabilities evolve. Given that AI companies are explicitly aiming to develop novel capabilities in their frontier models, this is crucial.
Most companies do not have a policy to put development on hold if they cannot sufficiently mitigate risk, or their threshold for doing so is vaguely defined, discretionary and non pre-emptive. While companies provide thresholds for when deployment would be paused, the same does not apply for development. This is problematic since many risks arise already during development. This represents a critical failure to prioritize safety over business objectives.
Separation of roles and responsibilities remains limited. A sound risk governance framework calls for distinct and separate entities and roles that can act as checks and balances on each other. The well-known Three Lines model, for example, calls for independent internal audit, clear responsibilities for the Board of Directors and an appropriately resourced central risk function that can challenge management on its risk decisions. None of the AI companies currently have all necessary components in place and only a few have distinct audit or risk teams with any independence.
Most frameworks are reactive rather than proactive with mitigations, failing to develop comprehensive, measurable ex ante risk management plans specifying precisely how they will mitigate risk, and why they believe these measures will be sufficient. To improve, frameworks should include more details on how they will evaluate the quality of their risk mitigation measures and provide proof that such measures will be sufficient before risks materialize. This is a critical gap given that effective mitigation measures require substantial development time and extensive advance planning.
There is a concerning development of ‘marginal risk clauses’, which make deployment decisions contingent on other companies’ risk tolerances. One company to include such a clause is OpenAI. Risk management is predicated on having an ‘unacceptable risk level’ that companies commit not to cross. Importantly then, this level should be independent of other companies’ risk levels. We strongly caution against these clauses, as they go against the spirit of risk management.

FAQ

What do these ratings achieve?

Our main concern is to push for transparency and accountability as AI progresses, and we view these ratings as a good first step in achieving that goal. At this stage there is no private actor we would feel comfortable moving forward and developing AI systems for the next few years without substantial overhaul – our ratings make that clear and incentivize change.

Won't these ratings encourage safety washing?

This is a challenge we’ve given a lot of thought to. Without compliance reviews of their frameworks, commitments may not amount to actual action. However, we believe commitments are the starting point. This is why we give concrete points of how companies can improve their practices for every criterion. Moreover, to avoid the gaming of our ratings, we may update the scale over time as industry practices mature.

Won’t SaferAI's risk management framework prevent us from reaping the benefits of AI?

This risk management framework is designed to favor transparency from AI companies. It is designed to ensure that democratically chosen risk preferences are respected through AI development & deployment, accounting for the benefits. As such, it should incentivize AI companies to develop technologies that have the highest chance of delivering AI benefits safely. Moreover, greater transparency from foundation model providers simplifies downstream deployment, accelerating adoption across the ecosystem.

Do you want corporate governance to substitute for regulation?

We want AI companies to improve their transparency and expect that to be complementary to existing regulations and to favor the development of future adequate regulations, providing well-needed data to encourage reasonable trade-offs when designing future rules.

What does a 100% rating mean?

A 100% rating indicates that a company has implemented risk management practices we consider strong and sufficient to manage the risks posed by advanced AI systems. This represents comprehensive implementation across all four dimensions of our framework. Companies achieving this score demonstrate mature, well-documented processes with robust oversight and accountability mechanisms. For a detailed explanation of each component and its importance, please refer to our risk management framework.

Why do you only look at frontier safety frameworks?

First, while some companies do include relevant risk management information in their research publications or model cards, focusing exclusively on frontier safety frameworks enables a systematic, apples-to-apples comparison across all companies. Companies vary significantly in their publication volume, and publications serve different purposes, which would create assessment imbalances if we looked at all documents.

Second, frontier safety frameworks represent formal, ongoing commitments to risk management practices, whereas research papers and model cards often demonstrate point-in-time implementations. By concentrating on these official frameworks, we ensure our ratings reflect companies' sustained approaches to AI safety rather than isolated examples of good practice.