INDEX
Explanations
proper nouns related to individuals in different fields such as politics, sports, and entertainment
New Auto-Interp
Negative Logits
Appalach
-0.77
Appalachian
-0.70
Willow
-0.69
··
-0.67
Misty
-0.67
Wyoming
-0.67
Beacon
-0.66
Lexington
-0.65
Eliot
-0.65
ModLoader
-0.64
POSITIVE LOGITS
iani
1.18
á
1.04
ondo
0.98
iano
0.96
ó
0.96
vez
0.95
aldi
0.95
lez
0.94
otti
0.94
utsch
0.92
Activations Density 0.209%