Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

price

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-gpt-oss-20b/resid_post_layer_11/trainer_0

Dataset (Dashboard)

Various

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 budgets

-0.08

 बज

-0.08

pygame

-0.08

 cuja

-0.08

contador

-0.08

Publicidade

-0.08

 प्रण

-0.08

وعة

-0.08

 orçamento

-0.08

علانات

-0.07

POSITIVE LOGITS

确定

0.08

祖

0.08

 créer

0.08

 عليهم

0.07

ikol

0.07

 outsiders

0.07

 tant

0.07

 favored

0.07

 bleu

0.07

 grandfather

0.07

Activations Density 0.009%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact