INDEX
Explanations
phrases related to competitive sports, specifically focusing on individual player performances and game strategies
New Auto-Interp
Negative Logits
ndum
-0.69
arij
-0.66
htaking
-0.65
dinand
-0.64
Flavoring
-0.63
andise
-0.63
ijn
-0.61
issance
-0.61
IFIED
-0.60
Dou
-0.59
POSITIVE LOGITS
owment
1.25
angering
1.22
ocrin
0.92
angered
0.90
ocrine
0.90
eared
0.86
angers
0.86
urance
0.85
anger
0.84
omet
0.82
Activations Density 0.026%