INDEX
Explanations
references to violence and supernatural elements
New Auto-Interp
Negative Logits
writeFieldEnd
-0.46
мәкалә
-0.45
propertyName
-0.43
👔
-0.43
ContentLoaded
-0.40
šnje
-0.38
COA
-0.36
Corso
-0.36
honore
-0.36
adaptiveStyles
-0.35
POSITIVE LOGITS
attack
0.52
fighting
0.51
frontal
0.50
offensive
0.50
confrontation
0.50
confronting
0.49
attacking
0.48
ата
0.46
serangan
0.46
battle
0.46
Activations Density 0.120%