INDEX
Explanations
words related to physical activities or tasks
phrases related to strategy and progression in competition or games
New Auto-Interp
Negative Logits
ortium
-0.60
named
-0.57
Cosponsors
-0.56
Revision
-0.52
çīĪ
-0.52
published
-0.52
irteen
-0.50
buster
-0.49
gov
-0.48
oslov
-0.48
POSITIVE LOGITS
crappy
0.60
predetermined
0.58
hassle
0.55
crap
0.53
yourself
0.53
tasty
0.53
boring
0.53
sweaty
0.53
desired
0.52
meal
0.52
Activations Density 2.964%