INDEX
Explanations
words related to criminal activities, specifically focusing on robbery
terms associated with robbery and criminal activities
New Auto-Interp
Negative Logits
mite
-0.87
ankind
-0.74
minist
-0.72
mitt
-0.72
Reviewer
-0.71
mos
-0.70
Plex
-0.70
mberg
-0.68
UE
-0.68
rolog
-0.67
POSITIVE LOGITS
robbery
1.19
spree
1.13
robberies
1.11
robbing
0.85
robbers
0.83
eering
0.80
robber
0.80
bery
0.78
robbed
0.76
sters
0.74
Activations Density 0.026%