INDEX
Explanations
references to scratching or related actions
New Auto-Interp
Negative Logits
perc
-0.17
ška
-0.16
Hawk
-0.16
ropa
-0.16
ight
-0.16
elda
-0.15
Wire
-0.15
urm
-0.15
ٳ
-0.15
uitka
-0.14
POSITIVE LOGITS
scratch
0.15
217
0.15
817
0.15
illation
0.15
267
0.15
.dtp
0.14
apter
0.14
Scratch
0.14
scratch
0.14
0.14
Activations Density 0.010%