INDEX
    Explanations

    Scandinavian and Hungarian encouragement

    New Auto-Interp
    Negative Logits
     peur
    0.39
     প্রত্যাখ্যান
    0.39
    0.39
    ),]
    0.38
     ಸಮಸ್ಯ
    0.38
    릿
    0.38
    ']))->
    0.38
     //}
    0.38
    insertOne
    0.37
     விசாரணை
    0.37
    POSITIVE LOGITS
     imponer
    0.62
     impon
    0.59
     bege
    0.57
     virkelig
    0.50
     begeistert
    0.49
     entusias
    0.49
     niezwy
    0.48
     മിക
    0.47
     uncompromising
    0.46
     mers
    0.46
    Act Density 0.001%

    No Known Activations