INDEX
Explanations
phrases related to driving
verbs related to the act of driving
New Auto-Interp
Negative Logits
Seym
-0.80
Lum
-0.78
Flavoring
-0.77
ANN
-0.67
ellen
-0.67
umbn
-0.64
Spir
-0.62
inite
-0.62
ertain
-0.62
achu
-0.61
POSITIVE LOGITS
wheel
1.05
driving
1.04
train
1.00
driving
0.92
bike
0.91
away
0.85
boats
0.84
wagen
0.81
whe
0.80
boat
0.79
Activations Density 0.044%