INDEX
Explanations
terms related to rotational mechanics and movements in machinery
New Auto-Interp
Negative Logits
inter
-0.52
mun
-0.46
cuito
-0.44
et
-0.44
-0.42
patch
-0.42
imp
-0.42
patch
-0.42
em
-0.42
ophy
-0.41
POSITIVE LOGITS
rotate
1.11
rotates
1.10
Rotate
1.09
rotating
1.03
Rotating
1.01
swing
1.00
Rotating
0.99
swings
0.98
rotated
0.97
rotated
0.97
Activations Density 0.814%