INDEX
Explanations
indicative verbs conveying ongoing or continuous states
New Auto-Interp
Negative Logits
anje
-0.16
querque
-0.15
ugu
-0.14
ursor
-0.14
ique
-0.14
ilog
-0.14
NOT
-0.14
yeniden
-0.14
iphy
-0.14
onestly
-0.13
POSITIVE LOGITS
ders
0.28
dess
0.16
ocal
0.16
vera
0.15
jsx
0.15
ingly
0.15
alive
0.15
true
0.15
etal
0.15
der
0.14
Activations Density 0.029%