INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
steen
-0.77
boss
-0.72
ibo
-0.70
oser
-0.69
eric
-0.68
bow
-0.68
thood
-0.67
eme
-0.67
isoft
-0.66
Malone
-0.65
POSITIVE LOGITS
glim
0.75
NetMessage
0.70
acad
0.69
glances
0.66
levers
0.65
citiz
0.65
liter
0.64
chairs
0.63
magazines
0.63
territ
0.62
Activations Density 0.000%
No Known Activations
This feature has no known activations.