INDEX
Explanations
names of specific locations or entities
specific names and terms related to locations and organizations
New Auto-Interp
Negative Logits
pand
-0.84
ptions
-0.83
cised
-0.77
sembly
-0.73
microscope
-0.65
YEAR
-0.64
Monkey
-0.62
ption
-0.62
wrapper
-0.62
Panda
-0.61
POSITIVE LOGITS
gow
0.96
orough
0.92
Ô
0.88
iana
0.82
nih
0.79
Osh
0.78
Lynd
0.75
sburgh
0.74
ijn
0.73
Pike
0.72
Activations Density 0.017%