INDEX
    Explanations

    references to oaths and vows, particularly in a context involving love and commitment

    New Auto-Interp
    Negative Logits
    å¡
    -0.18
    rana
    -0.14
    quier
    -0.14
    itzer
    -0.14
    ernel
    -0.14
     Perr
    -0.14
    witter
    -0.13
    Ñĥз
    -0.13
    ojis
    -0.13
    oes
    -0.13
    POSITIVE LOGITS
     oath
    0.27
     allegiance
    0.25
     sworn
    0.23
     swearing
    0.19
     swore
    0.19
     vows
    0.18
     swear
    0.18
     binds
    0.18
     pled
    0.17
     pledge
    0.17
    Act Density 0.032%

    No Known Activations