INDEX
Explanations
mentions of Jewish identity and the diversity within Judaism
New Auto-Interp
Negative Logits
rana
-0.19
andi
-0.16
Salem
-0.15
è͵
-0.15
.ax
-0.15
PROFITS
-0.15
ziej
-0.15
Gunn
-0.15
Carpenter
-0.14
acades
-0.14
POSITIVE LOGITS
Reb
0.28
Lub
0.24
770
0.24
reb
0.23
Crown
0.20
Pose
0.20
lub
0.19
lub
0.19
Sat
0.18
y
0.18
Activations Density 0.038%