INDEX
Explanations
mentions of victims or victorious events
terms related to victims and victimization
New Auto-Interp
Negative Logits
Shank
-0.70
backer
-0.68
waivers
-0.68
grounds
-0.63
WAYS
-0.63
Factor
-0.60
Kin
-0.59
Hath
-0.59
reservation
-0.59
marrow
-0.58
POSITIVE LOGITS
orian
1.73
orious
1.63
oire
1.58
ory
1.57
orians
1.54
oria
1.53
orical
1.48
orius
1.39
ories
1.38
oric
1.37
Activations Density 0.035%