INDEX
Explanations
instances of falsely or wrongly claiming or accusing
terms indicating falsehood or deception
New Auto-Interp
Negative Logits
soc
-0.75
soDeliveryDate
-0.69
iments
-0.67
RAW
-0.66
Legendary
-0.65
players
-0.64
player
-0.60
ilion
-0.60
href
-0.59
Reviewer
-0.58
POSITIVE LOGITS
declare
0.93
reproduce
0.93
withdrew
0.91
demonstrate
0.88
engage
0.87
utilize
0.87
proclaim
0.86
resided
0.86
differentiate
0.83
entered
0.83
Activations Density 0.176%