INDEX
Explanations
the word "Perfect" in varying contexts and phrases
the repeated use of the word "Perfect" with varying intensities
New Auto-Interp
Negative Logits
UGH
-0.71
UTH
-0.70
veins
-0.69
Sawyer
-0.66
Cornell
-0.65
externalActionCode
-0.65
epid
-0.63
ITNESS
-0.63
SIL
-0.63
unpublished
-0.62
POSITIVE LOGITS
ion
1.01
entimes
0.87
ity
0.87
erion
0.86
imum
0.86
intendent
0.84
ly
0.84
nesses
0.82
ation
0.82
perfect
0.80
Activations Density 0.009%