INDEX
Explanations
names and relevant biographical details of influential figures
New Auto-Interp
Negative Logits
rov
-0.17
Barth
-0.15
ège
-0.14
unnamed
-0.14
782
-0.14
-fold
-0.13
":[{"-0.13
Fold
-0.13
ivic
-0.13
Composition
-0.13
POSITIVE LOGITS
born
0.25
born
0.19
çĶŁ
0.18
çĶŁ
0.17
ìĥĿ
0.16
190
0.15
Born
0.15
çĶŁçļĦ
0.15
ÙĪÙĦد
0.15
192
0.15
Activations Density 0.108%