INDEX
Explanations
names of locations and institutions
New Auto-Interp
Negative Logits
ĸļ
-0.91
DERR
-0.76
CLASSIFIED
-0.74
McDonnell
-0.74
bourg
-0.73
ABE
-0.73
éĹĺ
-0.71
tenance
-0.69
Commonwealth
-0.68
士
-0.67
POSITIVE LOGITS
uana
1.19
Ig
0.99
uala
0.93
omez
0.92
reg
0.88
agi
0.86
les
0.86
uno
0.85
Nob
0.83
min
0.83
Activations Density 0.006%