INDEX
Explanations
recurrent phrases and expressions indicating preference or recommendation
New Auto-Interp
Negative Logits
ufe
-0.14
gly
-0.14
izo
-0.14
atoria
-0.13
vin
-0.13
kel
-0.13
minent
-0.13
.Library
-0.13
quette
-0.13
orrent
-0.13
POSITIVE LOGITS
idata
0.17
.AutoSizeMode
0.15
879
0.15
theless
0.15
olics
0.14
_cpus
0.14
داد
0.14
OSP
0.14
ossa
0.14
CONS
0.14
Activations Density 0.168%