INDEX
Explanations
names of historical or fictional figures, particularly those associated with religious or cultural significance
names and references related to historical or notable figures
New Auto-Interp
Negative Logits
Roosevelt
-0.95
ADV
-0.81
Cogn
-0.81
Fram
-0.78
Candy
-0.77
Carmen
-0.73
ogn
-0.72
artisan
-0.70
Okinawa
-0.69
Guam
-0.69
POSITIVE LOGITS
Moses
2.25
Bolt
1.83
Isaac
1.60
bolt
1.33
Gibbs
1.31
bolt
1.27
bolts
1.18
Fir
1.17
bane
1.15
Isa
1.12
Activations Density 0.032%