INDEX
Explanations
numerical values or parameters in various contexts
New Auto-Interp
Negative Logits
06
-0.18
906
-0.17
306
-0.17
Five
-0.17
05
-0.17
_five
-0.16
äºĶ
-0.16
ives
-0.16
Fifth
-0.16
/******/
-0.16
POSITIVE LOGITS
7
0.29
8
0.26
seventh
0.26
eighth
0.22
seven
0.22
Seventh
0.20
VII
0.19
eight
0.19
à¥Ń
0.18
July
0.18
Activations Density 0.076%