INDEX
Explanations
words related to riding things or vehicles
references to riding various modes of transportation, particularly bicycles and horses
New Auto-Interp
Head Attr Weights
0:0.07
1:0.03
2:0.14
3:0.07
4:0.08
5:0.06
6:0.02
7:0.02
8:0.28
9:0.11
10:0.04
11:0.02
Negative Logits
zsche
-1.35
gemony
-1.23
inctions
-1.22
nces
-1.21
distinguishing
-1.17
oteric
-1.17
rences
-1.16
hig
-1.13
gdala
-1.12
��
-1.10
POSITIVE LOGITS
brakes
1.35
roller
1.32
fever
1.28
wagon
1.28
carts
1.27
ousel
1.26
roller
1.26
broom
1.25
heels
1.25
stairs
1.22
Activations Density 0.009%