INDEX
Explanations
information related to the birth dates and places of various individuals
statements about people's birth and existence
New Auto-Interp
Negative Logits
ologies
-0.75
amplification
-0.74
oppers
-0.73
glers
-0.72
stabilization
-0.72
Extend
-0.72
lements
-0.70
HAVE
-0.69
ogether
-0.69
mitigation
-0.68
POSITIVE LOGITS
born
1.61
assassinated
1.07
married
1.02
famed
1.00
baptized
0.98
appointed
0.96
famous
0.95
crowned
0.94
originally
0.94
murdered
0.94
Activations Density 0.220%