INDEX
Explanations
mathematical formulas and symbols.
mathematical formulas
New Auto-Interp
Negative Logits
D
-0.50
Rom
-0.50
<bos>
-0.50
sig
-0.48
Z
-0.44
Y
-0.43
Push
-0.43
romas
-0.43
blo
-0.43
drawing
-0.42
POSITIVE LOGITS
Personendaten
0.90
Anſ
0.78
Majefty
0.77
ſtate
0.76
Diſ
0.76
poffe
0.75
fubject
0.74
myſelf
0.73
ſeveral
0.73
defire
0.72
Activations Density 0.344%