INDEX
    Explanations

    key nouns and certain prefixes or suffixes that indicate specific categories or attributes

    New Auto-Interp
    Negative Logits
    ocos
    -0.21
    èĤ²
    -0.19
     nÄĥ
    -0.19
     Gram
    -0.17
    aign
    -0.15
    andelier
    -0.15
     gram
    -0.15
    ousel
    -0.15
    Gram
    -0.15
    zend
    -0.15
    POSITIVE LOGITS
     ta
    0.20
    ta
    0.18
    Ta
    0.18
     Conc
    0.18
     Ta
    0.17
     Archer
    0.17
     Tanner
    0.16
     Shortcut
    0.16
     conc
    0.15
    -Ta
    0.15
    Act Density 0.019%

    No Known Activations