INDEX
Explanations
names of deceased individuals and their contributions or relationships
New Auto-Interp
Negative Logits
缮åīį
-0.18
ensued
-0.16
вок
-0.16
currently
-0.15
hasn
-0.15
sat
-0.14
aniem
-0.14
haven
-0.14
subst
-0.14
åı¤
-0.14
POSITIVE LOGITS
lived
0.24
died
0.23
dies
0.23
dying
0.22
never
0.21
æŃ»
0.21
Never
0.18
leaves
0.18
never
0.18
-lived
0.17
Activations Density 0.211%