INDEX
Explanations
references to significant historical figures and events related to Jewish history and persecution
New Auto-Interp
Negative Logits
ingo
-0.17
esign
-0.16
erti
-0.16
æľĹ
-0.15
Gi
-0.14
illez
-0.14
RAFT
-0.14
Gee
-0.14
awah
-0.14
udi
-0.14
POSITIVE LOGITS
Bund
0.24
Eastern
0.18
bund
0.17
룬
0.16
оÑĩной
0.16
pog
0.16
illo
0.15
Olsen
0.15
Eastern
0.15
770
0.15
Activations Density 0.029%