INDEX
Explanations
instances of monetary values and their representations
New Auto-Interp
Negative Logits
Anſ
-1.09
Theſe
-1.06
iſt
-1.04
ſelves
-1.00
ſind
-0.98
Reſ
-0.98
―――――
-0.97
itſelf
-0.96
ſy
-0.96
Eſ
-0.94
POSITIVE LOGITS
$
1.53
}$
1.23
}}$
1.08
$)
1.01
$)$
0.97
)$
0.97
]$
0.96
}$
0.93
</em>
0.92
\}$
0.91
Activations Density 0.412%