INDEX
Explanations
names and places related to territories and government positions
references to specific names and terms related to geography or people
New Auto-Interp
Negative Logits
ndra
-0.79
merce
-0.78
aptic
-0.77
ĺħ
-0.75
ograp
-0.73
ortion
-0.68
McAuliffe
-0.68
ographics
-0.68
pelled
-0.67
rha
-0.67
POSITIVE LOGITS
Lans
0.97
ecake
0.76
hell
0.74
Barcl
0.71
FORE
0.69
FE
0.67
ãĥĢ
0.67
FIN
0.67
better
0.66
brow
0.66
Activations Density 0.021%