INDEX
    Explanations

    expressions of knowledge and understanding

    New Auto-Interp
    Negative Logits
     McCart
    -0.17
    emailer
    -0.17
    mina
    -0.17
    иÑĤом
    -0.14
     дÑĥ
    -0.14
    rem
    -0.14
     trous
    -0.14
    ÂŃi
    -0.14
    ais
    -0.13
    omers
    -0.13
    POSITIVE LOGITS
    980
    0.15
    462
    0.14
    IDGE
    0.14
    agar
    0.14
    810
    0.14
     Attached
    0.14
    564
    0.14
    anje
    0.14
    918
    0.13
     alternate
    0.13
    Act Density 0.056%

    No Known Activations