INDEX
Explanations
proper nouns, specifically names and locations
references to locations and organizations, particularly in a news context
New Auto-Interp
Negative Logits
ively
-0.73
pity
-0.67
iveness
-0.65
uncture
-0.65
ulse
-0.65
Akron
-0.64
orship
-0.64
skin
-0.62
esthetic
-0.61
saf
-0.61
POSITIVE LOGITS
BER
1.38
LIN
1.29
GER
1.14
ertodd
1.09
RY
1.05
keley
1.05
EGIN
0.99
SU
0.99
ļéĨĴ
0.97
LAND
0.96
Activations Density 0.011%