INDEX
Explanations
words and phrases that express high intensity or emphasis
New Auto-Interp
Negative Logits
owie
-0.16
mana
-0.14
voir
-0.14
orde
-0.14
486
-0.13
ائÙĦ
-0.13
etwork
-0.13
Fist
-0.13
ALAR
-0.13
anian
-0.13
POSITIVE LOGITS
zik
0.16
abus
0.16
Instr
0.15
à¸Ńà¸ļ
0.14
리ìĹIJ
0.14
ноп
0.14
.BLL
0.14
ÙĤاÙħ
0.14
aepernick
0.14
ritel
0.14
Activations Density 0.013%