INDEX
    Explanations

    expressions of honesty or frankness in opinions

    New Auto-Interp
    Negative Logits
    aguya
    -0.71
    GEBURTS
    -0.65
    ódź
    -0.64
    ouncil
    -0.63
     pach
    -0.62
    americas
    -0.61
    :✨
    -0.60
    lemmer
    -0.60
    =’
    -0.59
    archiviato
    -0.59
    POSITIVE LOGITS
     honestly
    1.00
     frankly
    0.97
    Honestly
    0.94
    Frankly
    0.93
     Honestly
    0.90
    honestly
    0.84
     tbh
    0.70
     honest
    0.70
    ScopeManager
    0.69
     honn
    0.68
    Act Density 0.099%

    No Known Activations