INDEX
Explanations
references to religious titles and roles within the Jewish community
New Auto-Interp
Negative Logits
cete
-0.15
ilia
-0.15
avia
-0.15
uzu
-0.15
sns
-0.14
grese
-0.14
yre
-0.14
orio
-0.14
nika
-0.14
nton
-0.14
POSITIVE LOGITS
bin
0.31
bi
0.29
ble
0.25
bits
0.24
ban
0.23
би
0.21
bon
0.20
shake
0.20
bin
0.19
BLE
0.19
Activations Density 0.004%