INDEX
Explanations
phrases related to loss or defeat in a competitive context
references to sports losses or political defeats
New Auto-Interp
Negative Logits
ridges
-0.64
monds
-0.60
intakes
-0.60
fir
-0.59
eyed
-0.58
hubs
-0.58
rison
-0.58
FG
-0.58
Redditor
-0.57
intake
-0.56
POSITIVE LOGITS
ingly
0.85
aciously
0.79
achi
0.78
agi
0.74
vich
0.74
ables
0.74
hard
0.73
nces
0.72
acious
0.72
bite
0.72
Activations Density 0.054%