INDEX
    Explanations

    actions or suggestions that involve "taking" in various contexts

    New Auto-Interp
    Negative Logits
    務省
    -0.71
     Miser
    -0.68
     Horner
    -0.65
    partiet
    -0.65
    Miser
    -0.64
    landet
    -0.63
     errno
    -0.63
    irot
    -0.63
    ynthia
    -0.63
     Granger
    -0.63
    POSITIVE LOGITS
    Taking
    1.57
     Taking
    1.49
    take
    1.49
     taking
    1.43
     take
    1.41
    Take
    1.40
     taken
    1.40
    taking
    1.40
    took
    1.37
    taken
    1.33
    Act Density 0.165%

    No Known Activations