INDEX
Explanations
numerical data and formatting information within the text
New Auto-Interp
Negative Logits
Fifth
-0.31
fifth
-0.31
five
-0.30
Five
-0.30
äºĶ
-0.30
äºĶ
-0.29
_five
-0.28
five
-0.28
5
-0.26
Five
-0.26
POSITIVE LOGITS
8
0.36
7
0.30
eighth
0.26
9
0.25
ï¼ĺ
0.23
eight
0.23
Eighth
0.23
Û¸
0.23
seventh
0.22
८
0.22
Activations Density 0.062%