INDEX
    Explanations

    terms related to physical and emotional states or conditions

    New Auto-Interp
    Negative Logits
     alone
    -0.15
     par
    -0.14
    alone
    -0.14
    elin
    -0.13
    eling
    -0.13
    tring
    -0.13
    inish
    -0.13
     OK
    -0.13
    ases
    -0.13
    ase
    -0.13
    POSITIVE LOGITS
     real
    0.27
    -real
    0.26
    Real
    0.24
    (real
    0.23
     REAL
    0.23
     Real
    0.23
     羣
    0.22
    REAL
    0.22
    real
    0.22
    .Real
    0.22
    Act Density 0.132%

    No Known Activations