INDEX
Explanations
conjunctions or words connected to cause and effect relationships
conjunctions that indicate causative or consequent relationships
New Auto-Interp
Negative Logits
estern
-0.72
lication
-0.71
Bastard
-0.69
ãĥ¯ãĥ³
-0.66
Latest
-0.62
ochet
-0.62
iren
-0.62
Own
-0.62
wic
-0.60
ATT
-0.60
POSITIVE LOGITS
prevents
1.39
reduces
1.30
undermines
1.28
enhances
1.27
hinder
1.27
lessen
1.23
inhibits
1.22
enables
1.21
discourage
1.20
exacerbate
1.19
Activations Density 0.279%