INDEX
Explanations
elements related to formatting and style in texts, particularly bold and italics
New Auto-Interp
Negative Logits
rary
-0.17
ession
-0.15
elez
-0.15
ehir
-0.15
usk
-0.14
onomous
-0.14
aniel
-0.14
.lu
-0.14
igue
-0.14
enberg
-0.14
POSITIVE LOGITS
bold
0.33
italic
0.30
Bold
0.29
bold
0.29
italic
0.28
Italic
0.28
Ital
0.27
-bold
0.26
Bold
0.26
underline
0.25
Activations Density 0.048%