INDEX
Explanations
expressions of speculation, hypothesis, or conjecture
New Auto-Interp
Negative Logits
itſelf
-0.92
myſelf
-0.91
Efq
-0.86
חיצוניים
-0.85
Cæsar
-0.84
faſt
-0.82
Monfieur
-0.81
ſmall
-0.80
ſever
-0.79
Shakspeare
-0.79
POSITIVE LOGITS
perhaps
0.62
possibly
0.61
possibly
0.61
perhaps
0.58
may
0.57
likely
0.57
Possibly
0.56
Perhaps
0.51
Possibly
0.50
Perhaps
0.49
Activations Density 0.479%