INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     maintenant
    -0.08
     Meadows
    -0.07
     facilitating
    -0.07
    enty
    -0.07
    -0.07
    ROUP
    -0.06
    _logical
    -0.06
    _reports
    -0.06
    -0.06
    -rise
    -0.06
    POSITIVE LOGITS
    :red
    0.06
     обс
    0.06
     phon
    0.06
     zel
    0.06
    ._
    0.06
     italian
    0.06
     xOffset
    0.06
     trimmed
    0.06
     shrugged
    0.06
    ΑΡ
    0.06
    Act Density 0.028%

    No Known Activations