INDEX
Explanations
proper names in text
alphanumeric sequences and abbreviations related to names and titles
New Auto-Interp
Negative Logits
additive
-0.68
prol
-0.64
mosaic
-0.64
IUM
-0.62
brigade
-0.62
VIDEOS
-0.62
conditional
-0.61
samples
-0.61
commute
-0.60
overboard
-0.59
POSITIVE LOGITS
midt
1.04
ä
1.00
ü
0.97
itz
0.93
orthy
0.89
eker
0.89
utter
0.89
orde
0.89
ör
0.88
acker
0.88
Activations Density 0.156%