INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     manque
    -0.08
     eng
    -0.08
    vilupp
    -0.07
    produ
    -0.07
    Constru
    -0.07
    -built
    -0.07
    atil
    -0.07
    frequency
    -0.07
     instantiated
    -0.07
    nown
    -0.07
    POSITIVE LOGITS
     recruiters
    0.08
    .rectangle
    0.08
    .gridy
    0.08
    CCC
    0.08
    ymru
    0.08
     хлоп
    0.08
    ār
    0.08
    0.07
     whisper
    0.07
     IDC
    0.07
    Act Density 0.001%

    No Known Activations