INDEX
Explanations
phrases indicating possibility or likelihood
speculative statements or expressions of possibility
New Auto-Interp
Negative Logits
ament
-0.82
cies
-0.73
Lauder
-0.73
Constant
-0.71
Leah
-0.67
Palest
-0.65
shall
-0.60
oric
-0.60
rett
-0.60
cedented
-0.60
POSITIVE LOGITS
someday
1.19
ily
1.17
haps
1.04
feas
1.01
iest
1.00
conce
0.98
offend
0.95
be
0.94
plaus
0.93
hap
0.88
Activations Density 0.053%