INDEX
Explanations
numeric and date-related information
New Auto-Interp
Negative Logits
ä¸ĥ
-0.15
Fifth
-0.15
Five
-0.15
_Entry
-0.14
Seventh
-0.14
Fourth
-0.14
Seven
-0.14
ÙĨÙĪÙģ
-0.14
fourth
-0.14
ï¼Ļ
-0.13
POSITIVE LOGITS
1
0.25
0
0.24
2
0.22
3
0.21
.
0.19
<|end_of_text|>
0.19
143
0.17
8
0.17
9
0.17
127
0.17
Activations Density 0.041%