INDEX
Explanations
phrases indicating dedication or commitment to a cause
New Auto-Interp
Negative Logits
yer
-0.18
/cms
-0.16
Penal
-0.15
ernel
-0.15
alla
-0.15
PAD
-0.15
Dann
-0.14
abin
-0.14
Ã¥l
-0.14
agate
-0.14
POSITIVE LOGITS
vo
0.15
ByUrl
0.15
dden
0.14
åľ°ä¸ĭ
0.14
AYER
0.14
_CHAN
0.13
ille
0.13
ÑĤиÑĢов
0.13
çķ¥
0.13
ekim
0.13
Activations Density 0.010%