INDEX
Explanations
pronouns followed by verifiable claims or accusations
references to assertions and claims made by individuals or groups
New Auto-Interp
Negative Logits
hang
-0.64
endif
-0.63
geries
-0.59
Forth
-0.58
asking
-0.58
Fishing
-0.57
intrusion
-0.56
ettlement
-0.55
ancial
-0.55
Vine
-0.54
POSITIVE LOGITS
deems
1.09
believes
0.97
deemed
0.97
dubbed
0.96
termed
0.95
considers
0.94
deem
0.94
alleges
0.90
blames
0.85
ãĤ¹
0.82
Activations Density 0.154%