INDEX
Explanations
proper nouns or names
proper names, particularly those of individuals
New Auto-Interp
Negative Logits
psons
-0.81
culosis
-0.80
otype
-0.74
isation
-0.73
arial
-0.71
iator
-0.70
early
-0.70
ization
-0.69
wagon
-0.68
etheless
-0.68
POSITIVE LOGITS
Bass
0.88
Shack
0.85
Hug
0.82
Poll
0.81
Guard
0.81
De
0.80
Mash
0.79
Bot
0.78
Rum
0.77
Disc
0.77
Activations Density 0.013%