Skip to content Skip to sidebar Skip to footer

OpenAI

Gemma Scope 2: Helping the AI Safety Community Deepen Understanding of Complex Language Model Behavior

Announcing a new, open suite of tools for language model interpretability Large Language Models (LLMs) are capable of incredible feats of reasoning, yet their internal decision-making processes remain largely opaque. Should a system not behave as expected, a lack of visibility into its internal workings can make it difficult to pinpoint the exact reason for…

Read More

Deepening AI Safety Research with UK AI Security Institute (AISI)

Today, we're announcing an expanded partnership with the UK AI Security Institute (AISI) through a new Memorandum of Understanding focused on foundational security and safety research, to help ensure artificial intelligence is developed safely and benefits everyone. The research partnership with AISI is an important part of our broader collaboration with the UK government on…

Read More