INDEX
Explanations
phrases related to German terms or titles
references to the German magazine "Der Spiegel"
New Auto-Interp
Negative Logits
Intermediate
-0.69
zzi
-0.64
hearted
-0.64
Holmes
-0.62
coincidence
-0.60
eous
-0.60
orsi
-0.59
Yemen
-0.59
XL
-0.58
Elephant
-0.57
POSITIVE LOGITS
iving
1.17
isively
1.13
bys
1.04
ivation
1.01
isive
0.99
ived
0.98
ision
0.98
mal
0.96
ricks
0.94
anged
0.93
Activations Density 0.031%