INDEX
Explanations
HTML elements and their attributes
HTML tag delimiters
closing tags and structure
New Auto-Interp
Negative Logits
co
-0.71
ma
-0.67
part
-0.66
com
-0.65
po
-0.65
in
-0.64
b
-0.64
a
-0.64
p
-0.64
,
-0.63
POSITIVE LOGITS
myſelf
1.19
itſelf
1.18
themſelves
1.13
himſelf
1.08
raiſ
1.06
)}>
1.05
}}">
1.04
avoient
1.03
étoient
1.03
'}}>
1.02
Activations Density 0.094%