INDEX
Explanations
phrases indicating outcomes or results
forms of the verb "to be" and related status descriptors
New Auto-Interp
Negative Logits
ourt
-0.64
inas
-0.64
EMBER
-0.64
cit
-0.62
volent
-0.62
iazep
-0.61
acist
-0.59
trop
-0.59
laughs
-0.59
conserv
-0.58
POSITIVE LOGITS
therefore
0.93
also
0.93
however
0.82
definitely
0.81
certainly
0.79
undoubtedly
0.77
furthermore
0.75
doubtless
0.72
moreover
0.72
not
0.72
Activations Density 0.863%