INDEX
Explanations
references to individuals and their achievements or notable traits
New Auto-Interp
Negative Logits
ober
-0.15
ubern
-0.15
USTER
-0.14
bron
-0.14
antz
-0.14
Bowling
-0.14
Mare
-0.13
teri
-0.13
bern
-0.13
irst
-0.13
POSITIVE LOGITS
nt
0.16
nu
0.15
ona
0.14
/manage
0.14
PFN
0.14
Kelley
0.14
donc
0.14
ortho
0.14
removeAttr
0.13
zsche
0.13
Activations Density 0.095%