INDEX
Explanations
monetary values and numerical data related to financial performance
New Auto-Interp
Negative Logits
_rewrite
-0.16
urgeon
-0.16
fortune
-0.15
ãĥªãĥ³ãĤ°
-0.15
fort
-0.14
TypeInfo
-0.14
áz
-0.14
Robinson
-0.14
·¨
-0.14
Highlander
-0.14
POSITIVE LOGITS
58
0.62
59
0.57
581
0.49
57
0.48
580
0.47
582
0.46
583
0.46
585
0.45
579
0.45
584
0.45
Activations Density 0.030%