INDEX
Explanations
adjectives and adverbs that convey intensity or nuance in descriptions
New Auto-Interp
Negative Logits
olumn
-0.17
airs
-0.17
optic
-0.16
erk
-0.16
ibur
-0.15
ÌĢ
-0.15
reamble
-0.15
velte
-0.15
iciel
-0.15
dür
-0.15
POSITIVE LOGITS
ly
1.74
LY
1.05
ÑģÑı
0.65
lys
0.60
äºİ
0.60
lya
0.50
liness
0.48
lyn
0.44
ness
0.42
ãģ«
0.42
Activations Density 0.292%