INDEX
Explanations
future-oriented phrases indicating intention or planned actions
New Auto-Interp
Negative Logits
ento
-0.16
нак
-0.13
alty
-0.13
arf
-0.13
lobals
-0.13
cop
-0.13
involvement
-0.13
otur
-0.13
issa
-0.12
Īĺ
-0.12
POSITIVE LOGITS
released
0.19
release
0.19
available
0.19
exist
0.18
åŃĺåľ¨
0.18
_release
0.17
existence
0.17
exists
0.17
_AVAILABLE
0.16
release
0.16
Activations Density 0.170%