INDEX
Explanations
elements related to structured organization and positions within a system
New Auto-Interp
Negative Logits
rise
-0.15
ToFront
-0.15
],[-
-0.15
ÑĪин
-0.14
borough
-0.14
backs
-0.14
heads
-0.14
uman
-0.13
inland
-0.13
swire
-0.13
POSITIVE LOGITS
bottom
1.13
Bottom
0.99
bottom
0.94
Bottom
0.92
BOTTOM
0.87
-bottom
0.86
_bottom
0.80
bottoms
0.79
.bottom
0.75
BOTTOM
0.74
Activations Density 0.123%