INDEX
Explanations
instances of the word "Perhaps" followed by a sentence
New Auto-Interp
Negative Logits
lete
-0.86
iya
-0.86
ament
-0.83
cies
-0.78
ombat
-0.75
ieve
-0.74
ieves
-0.73
ocaust
-0.73
atches
-0.73
cium
-0.72
POSITIVE LOGITS
someday
1.18
misunder
0.90
unsurprisingly
0.88
subconscious
0.86
sensing
0.85
underest
0.81
underestimate
0.78
overest
0.77
somew
0.75
exagger
0.74
Activations Density 1.139%