INDEX
Explanations
names of various individuals, potentially related to sports
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
Gloria
-0.77
Dorothy
-0.68
PBS
-0.66
Wendy
-0.65
Linda
-0.65
Nobel
-0.65
Watergate
-0.64
izoph
-0.63
Karen
-0.63
RTX
-0.62
POSITIVE LOGITS
struggled
1.14
averaged
1.11
underwent
1.08
hasn
1.07
possesses
1.06
scored
1.05
excel
1.03
wears
1.03
played
1.02
suffers
1.00
Activations Density 0.203%