INDEX
Explanations
names of individuals or locations in various contexts
proper nouns, particularly names and institutions
New Auto-Interp
Negative Logits
oka
-1.01
ox
-0.92
LX
-0.87
OX
-0.85
opl
-0.82
omy
-0.82
TAMADRA
-0.81
Norwich
-0.80
WP
-0.79
710
-0.79
POSITIVE LOGITS
de
1.16
des
1.02
Mul
1.02
Del
1.02
Des
0.99
De
0.98
De
0.96
DE
0.95
Dul
0.94
DEM
0.94
Activations Density 0.413%