INDEX
    Explanations

    instances of willingness and agreements in various contexts

    New Auto-Interp
    Negative Logits
    (',');
    -0.61
    "]));
    -0.61
    IndentedString
    -0.61
    "]);
    -0.59
    ")->
    -0.58
     />);
    -0.58
    AutoScaleMode
    -0.58
     ↓,
    -0.57
    urat
    -0.56
    ]));
    -0.55
    POSITIVE LOGITS
     willingly
    0.89
     willing
    0.88
    willing
    0.86
     accept
    0.79
     accepting
    0.78
     aceitar
    0.77
    Willing
    0.76
     willingness
    0.76
     accepte
    0.76
     écoute
    0.75
    Act Density 0.235%

    No Known Activations