INDEX
    Explanations

    words related to physical structures and their condition or status

    New Auto-Interp
    Negative Logits
    ixel
    -0.15
    Carthy
    -0.14
     cramped
    -0.14
     opak
    -0.14
    订
    -0.14
     AssemblyTitle
    -0.13
    avax
    -0.13
    ATRIX
    -0.13
     Ledger
    -0.13
    249
    -0.13
    POSITIVE LOGITS
     abandoned
    0.58
     abandonment
    0.53
     abandon
    0.53
     deserted
    0.38
     fors
    0.35
     decay
    0.35
    andoned
    0.35
     abandoning
    0.35
     dil
    0.31
    å¼ĥ
    0.31
    Act Density 0.203%

    No Known Activations