INDEX
Explanations
words related to uncertainty or speculation
the phrase "may have" in various contexts
New Auto-Interp
Negative Logits
anding
-0.73
Crown
-0.70
ament
-0.59
reinforcement
-0.59
osi
-0.58
etition
-0.58
Harlem
-0.58
dding
-0.57
Tribune
-0.56
osa
-0.56
POSITIVE LOGITS
been
1.16
gotten
1.13
gotten
1.08
been
0.99
gone
0.92
underestimated
0.88
slipped
0.87
fallen
0.85
fooled
0.85
stumbled
0.85
Activations Density 0.046%