INDEX
Explanations
references to Jewish people and their historical contexts
New Auto-Interp
Negative Logits
Shakspeare
-0.78
Jefus
-0.77
Theſe
-0.71
Anſ
-0.71
Houſe
-0.70
itſelf
-0.68
Diſ
-0.68
Majefty
-0.68
myſelf
-0.68
Phry
-0.68
POSITIVE LOGITS
Jewish
1.25
Israel
1.08
Jews
1.08
Israeli
1.06
Jewish
1.05
jewish
0.94
Aviv
0.94
Israel
0.93
Jews
0.89
Rabbi
0.88
Activations Density 0.353%