INDEX
Explanations
declarations related to the identity and status of regions or entities
New Auto-Interp
Negative Logits
ancel
-0.17
omba
-0.16
atten
-0.15
amus
-0.14
eba
-0.14
биÑĤ
-0.14
nez
-0.14
ifen
-0.14
ugas
-0.14
amage
-0.13
POSITIVE LOGITS
home
0.34
home
0.23
-home
0.22
host
0.22
sede
0.21
Home
0.19
Home
0.19
(home
0.19
/home
0.19
天åłĤ
0.18
Activations Density 0.099%