INDEX
Explanations
proper nouns, specifically names
mentions of locations or geographical identifiers
New Auto-Interp
Negative Logits
plat
-0.68
dece
-0.66
wound
-0.63
sheep
-0.62
leve
-0.60
minster
-0.59
calc
-0.59
deeds
-0.58
Hayward
-0.58
gem
-0.58
POSITIVE LOGITS
AN
3.90
ANS
2.53
ANI
2.22
ANA
2.05
ANE
2.01
ANN
1.94
ANT
1.86
ans
1.78
ANC
1.70
ANY
1.62
Activations Density 0.012%