INDEX
    Explanations

    the word "train" and words ending in "en"

    New Auto-Interp
    Negative Logits
    )++;
    -0.52
     C
    -0.52
     bien
    -0.49
     &___
    -0.48
    спользова
    -0.48
    "");
    -0.47
    })();
    
    -0.47
     Get
    -0.47
    <eos>
    -0.47
    ';
    -0.47
    POSITIVE LOGITS
     myſelf
    0.87
     itſelf
    0.86
     feroit
    0.85
     sû
    0.84
    ScopeManager
    0.80
     againſt
    0.79
     originaux
    0.79
     médicaux
    0.74
    VersionUID
    0.73
     uſed
    0.72
    Act Density 0.297%

    No Known Activations