INDEX
    Explanations

    phrases that indicate setting or configuration instructions

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.93
     ostavi
    -0.88
     myſelf
    -0.87
     يتيمه
    -0.85
     الحره
    -0.82
     Jefus
    -0.82
     perſon
    -0.81
     houſe
    -0.80
     itſelf
    -0.79
     Anſ
    -0.78
    POSITIVE LOGITS
    0.67
    nonatomic
    0.63
    
    0.53
    чив
    0.52
     <
    0.51
    zies
    0.50
     a
    0.48
    írás
    0.48
    indi
    0.47
    hav
    0.47
    Act Density 0.046%

    No Known Activations