INDEX
Explanations
specific historical events and figures related to Jewish experiences
New Auto-Interp
Negative Logits
inan
-0.16
939
-0.16
клÑĥ
-0.15
istan
-0.14
Paramount
-0.14
communist
-0.14
Republic
-0.14
комÑĥ
-0.14
077
-0.14
Norman
-0.14
POSITIVE LOGITS
ts
0.26
Imperial
0.22
Ts
0.21
Ñ
0.20
-ts
0.20
Russo
0.19
ÐłÐ¾ÑģÑģийÑģкой
0.19
imperial
0.19
Petersburg
0.19
_ts
0.18
Activations Density 0.132%