INDEX
Explanations
mentions of physical articles of clothing, specifically belt buckles and belts
references to belts and buckles
New Auto-Interp
Negative Logits
stros
-0.78
ulton
-0.75
ebus
-0.72
berus
-0.71
zanne
-0.68
è£ıç
-0.68
OTOS
-0.68
ophysical
-0.68
*/(
-0.67
perture
-0.67
POSITIVE LOGITS
buckle
1.20
belt
1.05
belt
1.01
belts
0.94
Belt
0.94
tightening
0.88
holder
0.77
stan
0.77
fast
0.77
fed
0.76
Activations Density 0.017%