INDEX
    Explanations

    words related to classification or categories

    instances of the word "class" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    hiba
    -0.68
    aukee
    -0.64
    OPLE
    -0.62
    obin
    -0.62
    foreseen
    -0.61
     stump
    -0.61
    vernment
    -0.61
    orthy
    -0.61
    BAT
    -0.58
    vind
    -0.57
    POSITIVE LOGITS
    ifications
    1.40
    ifier
    1.31
    ifiers
    1.29
    ifying
    1.26
    ifies
    1.21
    ifiable
    1.16
    ified
    1.12
    ification
    1.06
    ically
    1.06
    room
    1.05
    Act Density 0.042%

    No Known Activations