INDEX
Explanations
instances of the word "barely"
New Auto-Interp
Negative Logits
å©Ĩ
-0.15
-ÑĤо
-0.15
wend
-0.15
ilir
-0.15
swick
-0.14
peed
-0.13
erm
-0.13
åĨħéĥ¨
-0.13
sWith
-0.13
Cotton
-0.13
POSITIVE LOGITS
oly
0.17
mina
0.16
agues
0.16
Doll
0.15
ihan
0.15
offs
0.15
-www
0.15
ụ
0.14
lez
0.14
borg
0.14
Activations Density 0.002%