INDEX
Explanations
locations or geographical contexts within texts
New Auto-Interp
Negative Logits
ADX
-0.16
estatus
-0.15
-UA
-0.15
ÄįÃŃ
-0.14
ISM
-0.14
PSA
-0.14
Pearce
-0.14
ushman
-0.14
ylland
-0.14
/misc
-0.14
POSITIVE LOGITS
cales
0.18
scales
0.15
genera
0.15
rio
0.14
riers
0.14
orc
0.14
ayout
0.14
Fade
0.13
phot
0.13
bald
0.13
Activations Density 0.054%