INDEX
Explanations
references to military actions or conflicts
New Auto-Interp
Negative Logits
onda
-0.22
ει
-0.15
yer
-0.15
Rein
-0.15
gles
-0.14
riad
-0.14
scal
-0.14
ymb
-0.14
abd
-0.14
пок
-0.14
POSITIVE LOGITS
太éĥİ
0.17
CLAIM
0.15
ÃĸL
0.15
Ú©Ø´
0.15
Claims
0.15
claim
0.15
Equ
0.15
лак
0.14
zap
0.14
è
0.14
Activations Density 0.117%