INDEX
Explanations
words related to transportation, specifically the concept of taking a ride
mentions of "ride" in various contexts related to transportation or experiences
New Auto-Interp
Negative Logits
Seym
-0.83
ongyang
-0.76
ilities
-0.75
mson
-0.72
vernment
-0.71
nomine
-0.71
initions
-0.70
ettings
-0.69
argon
-0.68
mercial
-0.68
POSITIVE LOGITS
ride
1.19
bike
1.13
rides
1.11
Ride
1.03
ride
0.90
boarding
0.88
ridden
0.82
rode
0.81
wright
0.80
train
0.78
Activations Density 0.012%