INDEX
Explanations
phrases related to expressing opinions or positions
end punctuation marks, particularly periods, in statements conveying certainty
New Auto-Interp
Negative Logits
ãĥĻ
-0.66
jugg
-0.66
conceal
-0.62
utical
-0.62
ãĤ©
-0.62
prey
-0.61
steadily
-0.61
disappear
-0.61
sneak
-0.61
curtain
-0.60
POSITIVE LOGITS
Specifically
0.87
Writing
0.87
``
0.86
Saying
0.75
According
0.75
Asked
0.74
"[
0.74
However
0.73
Called
0.72
Speaking
0.72
Activations Density 0.425%