INDEX
Explanations
text related to criminal activities
references to subcultures and social movements
New Auto-Interp
Negative Logits
ãģ¾
-0.49
appropriately
-0.48
],"
-0.46
',"
-0.42
ãĤĭ
-0.42
idth
-0.41
ãĤĮ
-0.41
*.
-0.41
$$$$
-0.40
ãĥ¼ãĥ«
-0.40
POSITIVE LOGITS
however
0.55
meanwhile
0.46
moreover
0.43
also
0.41
asm
0.41
WATCHED
0.41
therefore
0.41
ickr
0.40
though
0.39
NETWORK
0.38
Activations Density 3.953%