INDEX
    Explanations

    expressions related to making choices or decisions

    New Auto-Interp
    Negative Logits
     houſe
    -0.74
     Monfieur
    -0.72
     poffible
    -0.71
     ſche
    -0.69
     Conſ
    -0.66
     paff
    -0.64
     ſeveral
    -0.63
     ")");
    -0.63
    AndEndTag
    -0.62
    /*
    -0.60
    POSITIVE LOGITS
     decided
    0.85
     instead
    0.82
     rather
    0.77
     opted
    0.76
     décidé
    0.76
     решила
    0.76
     opting
    0.75
     решили
    0.71
     chose
    0.71
     choose
    0.70
    Act Density 0.269%

    No Known Activations