INDEX
Explanations
words related to racing or competitive sports events
New Auto-Interp
Negative Logits
porr
-0.15
strup
-0.15
OLOR
-0.15
IENT
-0.15
lux
-0.14
orra
-0.14
itsu
-0.14
AMENT
-0.14
.pref
-0.14
_RPC
-0.14
POSITIVE LOGITS
icator
0.16
hei
0.15
ework
0.14
ander
0.14
acy
0.14
standing
0.13
Pres
0.13
rema
0.13
DMI
0.13
FAG
0.13
Activations Density 0.108%