INDEX
Explanations
phrases that express varying degrees of certainty or emphasis
phrases that indicate a consequence or condition
New Auto-Interp
Negative Logits
ucc
-0.74
guiActiveUnfocused
-0.67
"},"
-0.64
leneck
-0.64
gat
-0.63
ufact
-0.60
robe
-0.59
fare
-0.59
atti
-0.57
ews
-0.56
POSITIVE LOGITS
coupled
1.21
incidentally
1.08
combined
1.00
plus
0.97
however
0.93
alas
0.89
along
0.87
needless
0.84
sadly
0.83
admittedly
0.82
Activations Density 0.078%