INDEX
Explanations
German names or titles
the occurrence of the name "von" in various contexts
New Auto-Interp
Negative Logits
ional
-0.84
procedural
-0.75
UGC
-0.74
atari
-0.71
taboola
-0.67
Canadian
-0.66
mable
-0.66
wives
-0.66
orph
-0.64
ointment
-0.63
POSITIVE LOGITS
Braun
1.11
neg
0.90
der
0.88
Syd
0.87
Frey
0.87
Kra
0.85
Wer
0.83
Doom
0.82
Stru
0.82
Schwarz
0.81
Activations Density 0.044%