INDEX
    Explanations

    words related to clothing and fabric items

    New Auto-Interp
    Negative Logits
    ']));
    -0.52
    TargetException
    -0.50
    )});
    -0.50
    "]);
    
    -0.49
    ']],
    -0.49
    "]);
    -0.48
    "]));
    -0.48
    ]");
    -0.47
    ]]);
    -0.47
    UnitTesting
    -0.47
    POSITIVE LOGITS
     scarf
    1.95
     scarves
    1.79
     Scarf
    1.69
     handkerchief
    1.00
    charpe
    0.96
     bandana
    0.93
     shawl
    0.88
     handker
    0.86
    🧣
    0.75
     Towel
    0.75
    Act Density 0.002%

    No Known Activations