INDEX
Explanations
words related to violence and violations
terms related to violations and legal issues
New Auto-Interp
Negative Logits
dress
-0.77
mares
-0.76
ointment
-0.74
guiActiveUnfocused
-0.69
erald
-0.68
washing
-0.62
Ħ¢
-0.61
mare
-0.61
nect
-0.61
wealth
-0.59
POSITIVE LOGITS
viol
1.05
violin
0.91
atis
0.90
amental
0.88
Viol
0.88
atos
0.87
encies
0.85
atio
0.83
ace
0.82
isions
0.82
Activations Density 0.017%