INDEX
Explanations
instances of the phrase "I am."
New Auto-Interp
Negative Logits
stants
-0.17
INC
-0.16
uhl
-0.16
ignKey
-0.16
inde
-0.15
INCT
-0.15
batim
-0.15
amba
-0.15
Uz
-0.14
cé
-0.14
POSITIVE LOGITS
buflen
0.19
\API
0.15
isor
0.15
ç¢İ
0.14
ieber
0.14
heed
0.14
QUERY
0.14
atha
0.13
olate
0.13
mah
0.13
Activations Density 0.004%