INDEX
Explanations
instances of the word "therefore."
New Auto-Interp
Negative Logits
ing
-0.87
er
-0.68
man
-0.67
hol
-0.64
yr
-0.63
ge
-0.62
Sha
-0.58
</em>
-0.58
dup
-0.57
frac
-0.57
POSITIVE LOGITS
therefore
2.36
Therefore
2.12
Therefore
2.08
therefore
2.04
therefor
1.85
Поэтому
1.56
Daarom
1.47
Derfor
1.46
Portanto
1.46
Deshalb
1.40
Activations Density 0.078%