INDEX
Explanations
references to individuals and their roles or actions within a community or organization
New Auto-Interp
Negative Logits
here
-0.22
however
-0.19
acades
-0.17
wherever
-0.17
à¤ĩसम
-0.17
здеÑģÑĮ
-0.16
therefore
-0.16
ivot
-0.15
feit
-0.15
meanwhile
-0.15
POSITIVE LOGITS
upon
0.16
769
0.14
Ãħ
0.14
768
0.14
uk
0.14
perm
0.14
et
0.14
Į
0.13
ëł´
0.13
meets
0.13
Activations Density 0.296%