INDEX
Explanations
terms related to accusations or claims, particularly of a serious nature
references to accusations or claims made about individuals or organizations
New Auto-Interp
Negative Logits
skill
-0.71
tz
-0.69
PRES
-0.67
patch
-0.64
arton
-0.63
spl
-0.62
kin
-0.62
birth
-0.62
perm
-0.62
ï¸
-0.62
POSITIVE LOGITS
allegations
1.17
accusations
1.04
allegation
0.96
accusation
0.93
accusing
0.88
alleges
0.86
alleging
0.86
accuser
0.84
accus
0.77
riott
0.76
Activations Density 0.027%