INDEX
    Explanations

    instances of communication and expressions of opinion

    New Auto-Interp
    Negative Logits
    These
    -0.94
     These
    -0.86
     Estos
    -0.69
    these
    -0.69
    This
    -0.67
     Estas
    -0.66
    Estas
    -0.66
    Estos
    -0.60
    Its
    -0.60
     estas
    -0.58
    POSITIVE LOGITS
    StructEnd
    0.82
    Personendaten
    0.77
     THAT
    0.75
     eso
    0.72
    Sucesor
    0.70
     dat
    0.67
    kloped
    0.66
     Eso
    0.66
    それも
    0.66
    นั้น
    0.63
    Act Density 0.153%

    No Known Activations