INDEX
Explanations
names of people and organizations
proper nouns, particularly names and organizations
New Auto-Interp
Negative Logits
20439
-0.65
chel
-0.61
..."
-0.53
berra
-0.53
Redd
-0.50
/_
-0.49
>>>>
-0.49
strument
-0.49
uff
-0.49
chest
-0.48
POSITIVE LOGITS
espie
0.56
rul
0.54
fared
0.54
detractors
0.53
responded
0.49
countered
0.48
blinked
0.48
govtrack
0.48
Facts
0.47
experimented
0.46
Activations Density 0.934%