INDEX
Explanations
sentences with financial or quantitative information
New Auto-Interp
Negative Logits
,
-0.23
:
-0.18
1
-0.17
130
-0.16
,↵
-0.16
Û²
-0.15
Û±
-0.14
leigh
-0.14
180
-0.14
cient
-0.14
POSITIVE LOGITS
00
0.61
95
0.48
50
0.45
oo
0.43
99
0.40
90
0.39
80
0.38
75
0.37
85
0.35
60
0.34
Activations Density 0.032%