INDEX
Explanations
causal relationships or reasoning in text through the use of the word "hence."
the word "hence" used in various contexts
New Auto-Interp
Negative Logits
Tasman
-0.63
hitter
-0.62
abies
-0.61
estation
-0.59
batter
-0.59
Bull
-0.58
Bastard
-0.58
>>>>
-0.58
battered
-0.57
Fram
-0.57
POSITIVE LOGITS
forth
1.92
forward
1.37
entimes
0.82
far
0.79
apy
0.78
comings
0.77
pend
0.77
hua
0.75
apers
0.75
noon
0.73
Activations Density 0.009%