INDEX
Explanations
This neuron primarily detects occurrences of the word “Teacher.”
New Auto-Interp
Negative Logits
Compression
-0.07
вп
-0.06
more
-0.06
Colony
-0.06
apult
-0.06
611
-0.06
Pipeline
-0.06
(build
-0.06
Alert
-0.06
REAT
-0.06
POSITIVE LOGITS
teacher
0.13
teachers
0.12
Teacher
0.12
Teachers
0.12
Teacher
0.10
teacher
0.09
tập
0.08
employee
0.08
teachers
0.07
átky
0.07
Activations Density 0.010%