INDEX
Explanations
proper nouns from various fields like names of people, places, and organizations
names and terms associated with prominent figures, particularly in politics and entertainment
New Auto-Interp
Negative Logits
Load
-0.72
subscript
-0.69
Seym
-0.67
Vec
-0.63
Ukrain
-0.63
Wem
-0.61
discrep
-0.61
pecul
-0.61
pestic
-0.58
specificity
-0.58
POSITIVE LOGITS
Jr
1.53
Sr
1.13
famously
0.95
III
0.92
enegger
0.86
Jr
0.84
erson
0.77
ervatives
0.74
aka
0.73
's
0.70
Activations Density 0.228%