INDEX
Explanations
mentions of the word "Sultan."
Sultan Abdulhamid II
New Auto-Interp
Negative Logits
estekak
-0.40
noastre
-0.39
Goed
-0.38
gemaakt
-0.36
travaillons
-0.33
break
-0.31
デイ
-0.31
kyard
-0.31
Past
-0.30
اح
-0.30
POSITIVE LOGITS
Sultan
2.52
Sultan
2.31
sultan
2.05
Sult
1.40
sult
1.23
ultan
1.09
سلط
1.02
السلط
0.86
Sullivan
0.81
SUL
0.75
Activations Density 0.001%