INDEX
Explanations
proper names and terms in German
New Auto-Interp
Negative Logits
theless
-0.76
compulsion
-0.69
captcha
-0.68
shockingly
-0.64
VIDE
-0.63
bullies
-0.62
microw
-0.61
duplication
-0.60
inarily
-0.60
bonded
-0.60
POSITIVE LOGITS
liga
0.97
Mé
0.90
Ãī
0.88
qui
0.86
Univers
0.85
usalem
0.85
arten
0.81
士
0.81
Ãī
0.81
aceae
0.79
Activations Density 0.154%