INDEX
Explanations
the word "pretty" used in various contexts to describe a favorable impression
New Auto-Interp
Negative Logits
somewhat
-0.16
/OR
-0.16
roma
-0.16
icerca
-0.16
polit
-0.15
ãĤĪãĤĬ
-0.15
ulumi
-0.14
odnÃŃ
-0.14
-0.14
exceedingly
-0.14
POSITIVE LOGITS
-ÑĤаки
0.26
much
0.21
darn
0.20
close
0.20
758
0.18
-close
0.17
much
0.16
close
0.16
itz
0.16
assin
0.16
Activations Density 0.021%