INDEX
Explanations
statements about government policies and community dynamics
New Auto-Interp
Negative Logits
uj
-0.16
ERO
-0.15
Morrison
-0.15
Weiner
-0.14
angep
-0.14
OT
-0.14
Janeiro
-0.14
erif
-0.13
æ··
-0.13
Samples
-0.13
POSITIVE LOGITS
similarly
0.22
ignum
0.18
likewise
0.18
ebenfalls
0.16
Similarly
0.15
á»ģ
0.15
uzzi
0.15
Similarly
0.14
otime
0.14
iver
0.14
Activations Density 0.259%