INDEX
Explanations
references to serious accusations or charges
mentions of allegations
New Auto-Interp
Negative Logits
keys
-0.73
busiest
-0.70
oppy
-0.70
ARCH
-0.70
patch
-0.69
tz
-0.69
skill
-0.66
arton
-0.65
å¸
-0.64
kin
-0.63
POSITIVE LOGITS
allegations
1.10
accusations
0.95
alleging
0.95
allegation
0.93
alleges
0.91
accusation
0.89
levied
0.87
leveled
0.85
accusing
0.83
leve
0.77
Activations Density 0.037%