INDEX
Explanations
concepts related to comparison and value assessment
New Auto-Interp
Negative Logits
erner
-0.15
-translate
-0.14
jal
-0.14
кÑĥл
-0.14
ixel
-0.14
_failure
-0.13
Studio
-0.13
ubb
-0.13
flows
-0.12
å¥ij
-0.12
POSITIVE LOGITS
bishop
0.29
knight
0.29
queens
0.27
bishops
0.27
knights
0.26
Bishop
0.25
pawn
0.24
Knight
0.23
kings
0.23
queen
0.23
Activations Density 0.007%