INDEX
Explanations
The neuron spotlights multiword proper names or titles (i.e. sequences of capitalized tokens forming named entities).
New Auto-Interp
Negative Logits
save
-0.08
mối
-0.07
Schwe
-0.07
tanım
-0.07
save
-0.07
-key
-0.07
studi
-0.07
save
-0.06
/<?
-0.06
Detected
-0.06
POSITIVE LOGITS
sublist
0.06
rcode
0.06
odom
0.06
یم
0.06
lights
0.06
KeyUp
0.05
ecera
0.05
oking
0.05
속
0.05
stopping
0.05
Activations Density 0.535%