INDEX
    Explanations

    specific code structure and variable definitions

    After punctuation or symbols

    New Auto-Interp
    Negative Logits
     Aniston
    -0.54
    stø
    -0.49
    führt
    -0.48
    ั่ว
    -0.44
     Ashford
    -0.43
    zuführen
    -0.43
    teine
    -0.42
     Noël
    -0.41
     simplifié
    -0.41
     Neuer
    -0.41
    POSITIVE LOGITS
    reg
    1.11
    Reg
    0.99
    REG
    0.97
     Riggs
    0.96
    ag
    0.96
     Reg
    0.95
     reg
    0.94
    Rég
    0.93
     REG
    0.91
    Ag
    0.89
    Act Density 0.483%

    No Known Activations