INDEX
Explanations
terms related to geographical locations or political entities
references to geopolitical entities and groups involved in humanitarian issues
New Auto-Interp
Negative Logits
TAMADRA
-0.64
uated
-0.63
bidden
-0.63
explan
-0.60
rewarded
-0.60
Peb
-0.60
200000
-0.59
Feast
-0.59
Houston
-0.59
Kurd
-0.57
POSITIVE LOGITS
oub
0.76
lance
0.75
casters
0.73
wings
0.72
obs
0.72
resses
0.71
ashes
0.70
isine
0.70
ress
0.69
guards
0.69
Activations Density 0.431%