INDEX
Explanations
phrases indicating lack or absence
instances of the word "None"
New Auto-Interp
Negative Logits
bledon
-0.73
widest
-0.69
guiActiveUnfocused
-0.68
cum
-0.67
rod
-0.64
roit
-0.63
romy
-0.63
ciation
-0.60
nas
-0.60
pu
-0.60
POSITIVE LOGITS
essee
0.97
Detected
0.82
lect
0.79
theless
0.73
etting
0.72
uther
0.72
orem
0.71
Ended
0.70
conom
0.67
sane
0.67
Activations Density 0.010%