INDEX
Explanations
modal verbs indicating obligation or necessity
New Auto-Interp
Negative Logits
elle
-0.19
Lifecycle
-0.18
asion
-0.17
ereo
-0.15
elles
-0.15
ideo
-0.15
poz
-0.15
utherford
-0.14
osity
-0.14
Ñĥм
-0.14
POSITIVE LOGITS
.toolbox
0.15
ring
0.15
éĵ
0.14
ring
0.14
éı
0.14
éĬ
0.14
upa
0.14
fix
0.14
synd
0.14
ivan
0.14
Activations Density 0.000%