INDEX
    Explanations

    phrases related to statements or declarations

    New Auto-Interp
    Negative Logits
    undown
    -0.67
     CLR
    -0.62
     Palestin
    -0.61
     charisma
    -0.60
     Kindle
    -0.60
     Morse
    -0.59
     paperback
    -0.59
     multiplication
    -0.59
     Rez
    -0.59
     unbeliev
    -0.58
    POSITIVE LOGITS
    ï¸ı
    0.96
    together
    0.92
    agree
    0.91
    selves
    0.89
    yg
    0.85
    east
    0.83
    ··
    0.81
    sure
    0.81
    mand
    0.80
    else
    0.77
    Act Density 0.131%

    No Known Activations