INDEX
Explanations
events involving violence or assaults against individuals, particularly focusing on details like the method and circumstances of the attacks
New Auto-Interp
Negative Logits
.cleanup
-0.15
imo
-0.15
eiusmod
-0.15
tác
-0.14
zase
-0.14
же
-0.14
ourg
-0.13
artık
-0.13
ÑĤакиÑħ
-0.13
kå
-0.13
POSITIVE LOGITS
while
0.38
whilst
0.31
moments
0.31
while
0.31
minutes
0.28
WHILE
0.27
seconds
0.25
_while
0.25
While
0.24
after
0.24
Activations Density 0.285%