INDEX
Explanations
HTML tags and structural elements within the document
New Auto-Interp
Negative Logits
imb
-0.16
aring
-0.14
δεÏĤ
-0.14
isten
-0.14
ICODE
-0.14
<small
-0.14
isman
-0.14
umin
-0.14
fait
-0.14
>Show
-0.13
POSITIVE LOGITS
id
0.26
>↵
0.22
align
0.21
style
0.21
div
0.18
role
0.18
>↵↵
0.18
Style
0.17
idual
0.17
>↵
0.16
Activations Density 0.014%