INDEX
Explanations
phrases related to race car driving and adjustments during races
New Auto-Interp
Negative Logits
θεν
-0.15
ibbon
-0.15
fare
-0.13
adiens
-0.13
hel
-0.13
تاÙĨ
-0.13
stad
-0.13
tez
-0.13
usted
-0.12
asons
-0.12
POSITIVE LOGITS
grip
0.24
setup
0.23
softer
0.21
setups
0.21
aer
0.21
balance
0.20
soft
0.20
Grip
0.20
degradation
0.19
PU
0.19
Activations Density 0.026%