INDEX
Explanations
proper names and titles, particularly with 'von'
instances of the name "von" in various contexts
New Auto-Interp
Negative Logits
gallery
-0.84
taboola
-0.78
rentice
-0.70
orative
-0.69
inates
-0.68
uyomi
-0.68
ij士
-0.67
ा
-0.67
à¥
-0.67
inals
-0.66
POSITIVE LOGITS
Braun
0.85
Frey
0.85
amins
0.73
env
0.72
hof
0.72
Doom
0.69
der
0.69
Karma
0.68
wald
0.68
Schwarz
0.68
Activations Density 0.019%