INDEX
Explanations
references to specific places and events associated with Middle Eastern culture and politics
New Auto-Interp
Negative Logits
itary
-0.17
udi
-0.17
tes
-0.16
Jord
-0.14
ssa
-0.14
Lac
-0.13
ÏĮÏģ
-0.13
ude
-0.13
Tight
-0.13
REE
-0.13
POSITIVE LOGITS
ahren
0.19
Savage
0.15
Roe
0.15
_TIM
0.15
arent
0.15
łí
0.14
antal
0.14
HAM
0.14
Bieber
0.14
ürn
0.14
Activations Density 1.403%