INDEX
Explanations
references to war and conflict, including specific terms associated with warfare and battles
New Auto-Interp
Negative Logits
estruction
-0.17
žila
-0.16
oo
-0.16
ople
-0.15
chen
-0.15
itsu
-0.15
rossover
-0.14
elsing
-0.14
ernal
-0.14
OPLE
-0.14
POSITIVE LOGITS
zone
0.17
lord
0.17
lords
0.16
cci
0.14
AVA
0.14
against
0.14
ÚĨÙĩ
0.14
æľ«
0.13
Against
0.13
blers
0.13
Activations Density 0.044%