INDEX
    Explanations

    references to gratitude and acknowledgment of individuals or groups

    New Auto-Interp
    Negative Logits
    cheid
    -0.17
    kowski
    -0.15
    lector
    -0.15
    vig
    -0.15
    _Context
    -0.15
    eba
    -0.14
    oblin
    -0.14
    imler
    -0.14
    ighth
    -0.14
    nten
    -0.14
    POSITIVE LOGITS
    845
    0.20
    äºķ
    0.16
    ÙĦس
    0.15
    ÏģÎŃ
    0.15
    DET
    0.15
    963
    0.14
     зав
    0.14
    à¤Ĥà¤ĸ
    0.14
     Tactics
    0.14
    ิว
    0.14
    Act Density 0.025%

    No Known Activations