INDEX
Explanations
names of people or entities that are being accused of something
names of individuals or entities involved in accusations
New Auto-Interp
Negative Logits
æĸ¹
-0.87
antha
-0.71
ONSORED
-0.71
attest
-0.63
largeDownload
-0.62
thus
-0.62
ourney
-0.62
areth
-0.61
redo
-0.60
arious
-0.59
POSITIVE LOGITS
unfairly
0.86
wrongly
0.73
unjust
0.71
failing
0.68
foul
0.67
of
0.66
inaction
0.66
falsely
0.65
cheat
0.65
tics
0.65
Activations Density 0.106%