INDEX
Explanations
phrases indicating inclusion or references to lists
New Auto-Interp
Negative Logits
label
-0.49
mės
-0.49
固
-0.46
for
-0.45
-0.45
trebui
-0.45
-
-0.44
を受ける
-0.44
Governor
-0.44
http
-0.44
POSITIVE LOGITS
AMONG
1.25
amongst
1.10
Amongst
1.09
Amid
1.08
among
1.05
amidst
1.02
Amid
1.02
among
1.01
amid
0.98
śród
0.97
Activations Density 0.253%