INDEX
Explanations
terms related to residency and resident status
New Auto-Interp
Negative Logits
gaard
-0.15
rud
-0.15
osy
-0.15
onomous
-0.15
lef
-0.15
PRS
-0.15
oq
-0.14
iba
-0.14
ascus
-0.14
ippi
-0.14
POSITIVE LOGITS
Evil
0.32
evil
0.30
evil
0.29
evi
0.22
Ev
0.18
_ev
0.17
_fonts
0.16
EV
0.16
Evans
0.16
evils
0.16
Activations Density 0.003%