INDEX
Explanations
numerical values and their context within a monetary or quantity framework
New Auto-Interp
Negative Logits
dieß
-0.82
itſelf
-0.77
ainfi
-0.74
dépens
-0.73
nothwendig
-0.72
leaſt
-0.71
vœux
-0.70
enfans
-0.70
lèvres
-0.68
étoient
-0.68
POSITIVE LOGITS
or
1.01
fucking
0.85
damn
0.83
million
0.82
goddamn
0.78
thousand
0.77
something
0.76
something
0.75
fucking
0.75
hundred
0.73
Activations Density 0.294%