INDEX
    Explanations

    references to licenses and legal information within text

    New Auto-Interp
    Negative Logits
     l
    -1.08
    l
    -0.90
    ll
    -0.65
     la
    -0.64
     le
    -0.64
    la
    -0.60
    -0.60
    le
    -0.60
    -0.59
     ll
    -0.58
    POSITIVE LOGITS
     Л
    1.05
     Lo
    1.01
     LL
    1.00
     L
    0.97
     LC
    0.95
     Ли
    0.93
     LI
    0.93
     LR
    0.91
     Li
    0.90
     LF
    0.89
    Act Density 1.309%

    No Known Activations