INDEX
Explanations
Writing/creation
The neuron detects mentions of student misbehavior or disciplinary problems (e.g. bullying, bullies, troublemakers).
New Auto-Interp
Negative Logits
inet
-0.06
isn
-0.06
竞
-0.06
angement
-0.06
reement
-0.06
couldn
-0.06
-0.06
made
-0.06
fileType
-0.06
moeten
-0.06
POSITIVE LOGITS
гар
0.08
[])↵↵
0.07
Settings
0.06
�
0.06
区
0.06
ắn
0.06
CAT
0.06
PER
0.06
عباس
0.06
üzerinden
0.06
Activations Density 0.256%