INDEX
Explanations
HTML-related tags and attributes
New Auto-Interp
Negative Logits
classes
-0.34
Classes
-0.32
classes
-0.32
-class
-0.32
classname
-0.30
-Class
-0.30
.classes
-0.29
клаÑģÑģ
-0.29
/classes
-0.28
klass
-0.28
POSITIVE LOGITS
data
0.24
style
0.23
style
0.23
role
0.21
aria
0.19
data
0.19
Style
0.19
STYLE
0.18
Style
0.17
id
0.17
Activations Density 0.016%