INDEX
Explanations
instances of the word "single" followed by a numeric value or the word "one"
New Auto-Interp
Negative Logits
eln
-0.78
ello
-0.76
iosyn
-0.73
olas
-0.73
aba
-0.68
enaries
-0.68
srfAttach
-0.68
HCR
-0.67
ÃĥÃĤ
-0.66
ctr
-0.66
POSITIVE LOGITS
THING
1.22
imaginable
1.04
inch
0.96
conceivable
0.96
WHERE
0.96
penny
0.96
thing
0.96
goddamn
0.92
ounce
0.91
facet
0.90
Activations Density 0.030%