INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    act
    -0.89
    ect
    -0.68
    ACT
    -0.67
    ec
    -0.60
    cl
    -0.58
    id
    -0.58
    ist
    -0.52
    ho
    -0.51
    ide
    -0.50
    a
    -0.49
    POSITIVE LOGITS
    ]")]
    1.19
     Normdatei
    1.13
    )");
    
    1.12
    $.
    
    1.11
    ScopeManager
    1.09
    '},
    
    1.09
    "):
    
    1.07
    ')")
    1.05
    .)}
    1.03
     "];
    1.01
    Act Density 0.212%

    No Known Activations