INDEX
Explanations
elements related to user interface or component attributes in code
New Auto-Interp
Negative Logits
ATAR
-0.18
illac
-0.17
ollow
-0.17
STYLE
-0.17
ást
-0.17
oha
-0.17
agal
-0.15
imore
-0.15
-uppercase
-0.15
леÑĤ
-0.15
POSITIVE LOGITS
w
0.18
flex
0.17
sm
0.17
ire
0.16
wire
0.16
Chandler
0.16
NEXT
0.16
kyt
0.15
w
0.15
classNames
0.15
Activations Density 0.004%