INDEX
Explanations
proper nouns, particularly names and locations related to sports figures and teams
New Auto-Interp
Negative Logits
adera
-0.19
ãĤ¹ãĤ¿ãĥ¼
-0.15
posables
-0.15
inders
-0.15
èį
-0.15
ÄĽr
-0.15
amera
-0.14
-NLS
-0.14
aras
-0.14
ëij¥
-0.14
POSITIVE LOGITS
react
0.21
gestures
0.20
compet
0.19
cele
0.18
warming
0.17
reacts
0.17
celebrate
0.17
looks
0.17
gest
0.17
during
0.17
Activations Density 0.018%