INDEX
    Explanations

    references to numbers and their significance in various contexts

    New Auto-Interp
    Negative Logits
    ing
    -0.16
    ве
    -0.15
    rop
    -0.15
    war
    -0.15
    ings
    -0.15
    ITTE
    -0.14
    ViewHolder
    -0.14
    seed
    -0.14
    idelberg
    -0.14
    er
    -0.14
    POSITIVE LOGITS
    UpDown
    0.20
    eral
    0.20
    osity
    0.18
    ismatic
    0.18
    ical
    0.17
    ICAL
    0.17
    érique
    0.17
     Bris
    0.17
    rical
    0.15
    ERO
    0.15
    Act Density 0.015%

    No Known Activations