INDEX
Explanations
instances where the word "perhaps" is used
instances of the word "perhaps"
New Auto-Interp
Negative Logits
lete
-0.73
ocene
-0.73
arthed
-0.72
tains
-0.72
vance
-0.71
ulative
-0.70
elight
-0.69
yers
-0.68
eworld
-0.68
ved
-0.67
POSITIVE LOGITS
unsurprisingly
0.80
"$:/
0.76
sensing
0.75
unsus
0.73
amen
0.73
allev
0.72
irrit
0.71
opio
0.70
ironically
0.70
occas
0.68
Activations Density 0.016%