Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

assessing management experience

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/resid_post/layer_16_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ifdef

0.68

jols

0.64

jon

0.63

ج

0.63

 الفرنس

0.61

Uw

0.59

ീ

0.57

ossus

0.57

 المعروف

0.57

ißler

0.56

POSITIVE LOGITS

רה

0.69

কে

0.65

ל

0.65

 time

0.65

 layer

0.64

..

0.63

ﻚ

0.62

on

0.61

ยาน

0.61

0.60

Activations Density 0.000%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact