INDEX
    Explanations

    negations and conditional statements

    New Auto-Interp
    Negative Logits
    esar
    -0.14
    mac
    -0.14
    isable
    -0.14
     underground
    -0.14
    oi
    -0.14
    Ve
    -0.14
    .o
    -0.14
    able
    -0.14
     Ve
    -0.14
    ighet
    -0.13
    POSITIVE LOGITS
    ctors
    0.16
    emento
    0.16
    ahren
    0.16
    akov
    0.16
     Spoon
    0.15
    SSIP
    0.15
    &type
    0.15
    auge
    0.15
    bis
    0.15
    SPARENT
    0.15
    Act Density 0.001%

    No Known Activations