INDEX
Explanations
punctuation marks, particularly periods and exclamation points
New Auto-Interp
Negative Logits
iba
-0.19
/Dk
-0.17
AccessException
-0.16
Ventures
-0.15
igor
-0.15
ëį°ìĿ´íĬ¸
-0.15
ifold
-0.14
umph
-0.14
atatype
-0.14
Fang
-0.14
POSITIVE LOGITS
rist
0.19
rio
0.16
orc
0.16
stu
0.15
UNC
0.15
agog
0.14
Administrator
0.14
Ric
0.14
unc
0.14
=".$_
0.14
Activations Density 0.002%