INDEX
Explanations
mentions of religious figures and institutions, particularly related to the Jewish faith
references to the term "rabbis."
New Auto-Interp
Negative Logits
Âł Âł Âł Âł
-0.74
Âł Âł Âł Âł Âł Âł Âł Âł
-0.74
Gutenberg
-0.72
66666666
-0.72
ghazi
-0.71
Vietnam
-0.69
Hurricanes
-0.66
Lisbon
-0.65
Likely
-0.64
Libre
-0.64
POSITIVE LOGITS
rabb
1.12
inical
0.95
etry
0.88
inic
0.82
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
0.81
enn
0.80
inated
0.80
ety
0.80
itte
0.79
Rabb
0.79
Activations Density 0.015%