INDEX
Explanations
phrases related to legal or formal language
instances of the word "and."
New Auto-Interp
Negative Logits
fur
-0.58
agger
-0.57
Street
-0.57
gie
-0.56
Born
-0.54
uct
-0.53
urg
-0.53
Powered
-0.52
actionDate
-0.51
Ĥİ
-0.50
POSITIVE LOGITS
consequently
1.17
hence
1.07
thus
1.06
therefore
1.06
secondly
1.06
thereby
1.00
furthermore
0.86
vice
0.80
moreover
0.79
expects
0.74
Activations Density 0.380%