INDEX
    Explanations

    language that expresses appreciation or gratitude

    New Auto-Interp
    Negative Logits
    Jährige
    -0.51
    +#+#
    -0.49
     referenties
    -0.47
    expandindo
    -0.44
    πάρχ
    -0.43
    tet
    -0.43
    venidos
    -0.42
     bru
    -0.42
     Gaye
    -0.42
    ブル
    -0.41
    POSITIVE LOGITS
     winners
    0.99
     winner
    0.86
    Winners
    0.85
     prize
    0.84
     Winners
    0.84
     prizes
    0.82
     Prizes
    0.76
     Prize
    0.75
     Winner
    0.75
    winners
    0.75
    Act Density 0.061%

    No Known Activations