INDEX
Explanations
complex reports or essays
This neuron detects mentions of school-related terms, particularly “school” and its associated staff or campus contexts.
New Auto-Interp
Negative Logits
ographically
-0.07
��
-0.06
tolerate
-0.06
τους
-0.06
tic
-0.06
адки
-0.06
ับส
-0.06
helping
-0.06
attacks
-0.06
strategic
-0.06
POSITIVE LOGITS
tutar
0.07
Nová
0.07
栏
0.06
behind
0.06
Üst
0.06
_Block
0.06
GD
0.06
olay
0.06
postav
0.06
detective
0.06
Activations Density 0.153%