INDEX
Explanations
elements related to craftsmanship and intricate design
New Auto-Interp
Negative Logits
üst
-0.17
ersed
-0.16
genu
-0.15
Stap
-0.14
iltr
-0.14
ниÑĤ
-0.14
patched
-0.14
neutr
-0.13
decom
-0.13
æķ£
-0.13
POSITIVE LOGITS
car
0.35
carve
0.32
carving
0.32
-car
0.31
Car
0.31
carved
0.29
et
0.29
car
0.29
ch
0.27
Car
0.27
Activations Density 0.127%