INDEX
Explanations
proper nouns related to locations or entities
proper nouns, particularly names of places and people
New Auto-Interp
Negative Logits
fertile
-0.71
opposite
-0.67
direction
-0.65
wealthy
-0.63
dime
-0.60
foreseeable
-0.60
GF
-0.59
richer
-0.59
powerful
-0.58
lifetime
-0.58
POSITIVE LOGITS
theless
1.16
gomery
1.09
anyahu
1.08
agascar
1.08
terday
1.03
tenance
0.99
Berry
0.97
mosp
0.95
odore
0.95
withstanding
0.94
Activations Density 0.334%