INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Feelings
    -0.40
     Redesign
    -0.39
     OnInit
    -0.39
     pendek
    -0.37
    loem
    -0.36
    SNE
    -0.36
     Stoppers
    -0.35
     Shaft
    -0.34
     Slug
    -0.34
    şört
    -0.34
    POSITIVE LOGITS
    every
    0.91
     Every
    0.83
    Every
    0.82
     EVERY
    0.72
     every
    0.71
    EVERY
    0.70
    GIVEREF
    0.69
     Ogni
    0.67
    Ogni
    0.66
    GEBURTSDATUM
    0.65
    Act Density 0.021%

    No Known Activations