INDEX
    Explanations

    references to transformation or improvement in various contexts

    New Auto-Interp
    Negative Logits
     houſe
    -0.70
     myſelf
    -0.67
     pleaſure
    -0.62
     Efq
    -0.58
     fhew
    -0.58
     enfans
    -0.57
     itſelf
    -0.56
     Eſ
    -0.55
    ſelves
    -0.54
     Monfieur
    -0.53
    POSITIVE LOGITS
    migrationBuilder
    0.64
    0.62
    LookAnd
    0.58
     становника
    0.57
    interopRequire
    0.55
    したのが
    0.53
     backward
    0.52
    LEncoder
    0.52
    ftagPool
    0.51
     steroids
    0.51
    Act Density 0.382%

    No Known Activations