INDEX
Explanations
phrases indicating assumptions or expectations
expressions of hypothetical situations or conjectures
New Auto-Interp
Negative Logits
Countdown
-0.61
rawdownloadcloneembedreportprint
-0.58
Tonight
-0.57
Enough
-0.56
Leilan
-0.55
lobb
-0.54
Canberra
-0.54
Brill
-0.54
Gutenberg
-0.54
Ready
-0.53
POSITIVE LOGITS
expect
1.31
think
1.23
imagine
1.14
guess
1.05
assume
1.04
suppose
1.04
presume
1.04
wonder
1.00
hope
0.97
suspect
0.96
Activations Density 0.077%