INDEX
Explanations
references to historical figures and their biographical details
New Auto-Interp
Negative Logits
Folding
-0.07
ocket
-0.06
Fold
-0.06
fold
-0.06
abbr
-0.06
.enterprise
-0.06
hazi
-0.06
inter
-0.06
cente
-0.06
ustin
-0.06
POSITIVE LOGITS
born
0.20
Born
0.17
Born
0.16
born
0.15
-born
0.14
çĶŁ
0.13
birth
0.12
çĶŁçļĦ
0.11
native
0.10
çĶŁ
0.10
Activations Density 0.046%