Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

ham and prosciutto

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/resid_post/layer_16_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

0.63

0.56

ær

0.53

0.51

[\

0.50

大

0.50

Mits

0.49

css

0.49

जि

0.48

지는

0.48

POSITIVE LOGITS

ק

0.61

एडा

0.57

 viande

0.55

 keuken

0.55

 tien

0.55

IERC

0.55

sciutto

0.54

ό

0.54

 meats

0.53

 Sorrento

0.53

Activations Density 0.001%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact