INDEX
Explanations
references to vertical and horizontal orientations in various contexts
New Auto-Interp
Negative Logits
giving
-0.82
mberg
-0.80
REDACTED
-0.80
ãģ®éŃĶ
-0.80
unes
-0.76
ãģ¦
-0.74
IVERS
-0.74
erv
-0.73
nil
-0.72
lyak
-0.72
POSITIVE LOGITS
axis
1.02
stripes
0.92
separation
0.91
takeoff
0.89
ascent
0.88
dimension
0.87
orientation
0.86
leap
0.86
shaft
0.86
vertical
0.85
Activations Density 0.007%