INDEX
    Explanations

    references to real-life events and situations

    New Auto-Interp
    Negative Logits
    istrovstvÃŃ
    -0.17
    xfb
    -0.15
    etxt
    -0.14
    iant
    -0.14
    iales
    -0.14
     YYS
    -0.14
    conds
    -0.14
    cept
    -0.14
    éric
    -0.14
    stvo
    -0.13
    POSITIVE LOGITS
     counterpart
    0.18
    imony
    0.16
    eco
    0.15
    gear
    0.15
     counterparts
    0.15
     bat
    0.14
     augment
    0.14
    like
    0.14
     occurrences
    0.14
    .opensource
    0.14
    Act Density 0.046%

    No Known Activations