INDEX
Explanations
intensifiers and adverbs that emphasize certainty or strength
New Auto-Interp
Negative Logits
llib
-0.16
Weaver
-0.16
instr
-0.14
Bea
-0.14
_tF
-0.14
antry
-0.14
rrha
-0.14
oogle
-0.13
arg
-0.13
Ct
-0.13
POSITIVE LOGITS
اÙĦÙī
0.16
ifies
0.15
sprites
0.14
awa
0.14
uma
0.14
ANNOT
0.14
has
0.14
بش
0.14
had
0.14
neg
0.14
Activations Density 0.328%