INDEX
Explanations
instances where the word "none" is mentioned
phrases emphasizing the concept of "none" or the absence of something
New Auto-Interp
Negative Logits
bledon
-0.72
srf
-0.70
illon
-0.67
widest
-0.64
DA
-0.62
lished
-0.59
urry
-0.59
romy
-0.57
rote
-0.56
gnu
-0.55
POSITIVE LOGITS
conom
0.96
theless
0.85
uther
0.81
xus
0.80
essee
0.79
lust
0.75
except
0.74
galitarian
0.74
Detected
0.73
else
0.70
Activations Density 0.017%