INDEX
Explanations
words signaling conflicting or contrasting information
occurrences of the conjunction "Or" indicating alternatives or options
New Auto-Interp
Negative Logits
enment
-0.71
"}],"
-0.70
UD
-0.64
"],"
-0.63
arthed
-0.62
natureconservancy
-0.60
encers
-0.60
Enhancement
-0.59
âĢİ
-0.58
Ĥİ
-0.57
POSITIVE LOGITS
thodox
1.24
phans
1.24
maybe
1.21
lando
1.18
phan
1.17
perhaps
1.04
alternatively
1.01
chard
0.99
acular
0.97
anges
0.96
Activations Density 0.050%