INDEX
    Explanations

    phrases indicating disagreement or urging caution in discussions

    New Auto-Interp
    Negative Logits
    MethodManager
    -0.50
    bootstrapcdn
    -0.48
    enschappelijke
    -0.41
     Ressources
    -0.41
     SwitchCompat
    -0.41
    的呢
    -0.40
     bougies
    -0.40
    <()>
    -0.39
    Objective
    -0.39
     intStringLen
    -0.39
    POSITIVE LOGITS
     jangan
    0.51
     Jangan
    0.50
     Remember
    0.50
     đừng
    0.49
    Jangan
    0.48
     be
    0.48
     不要
    0.46
     don
    0.45
     Don
    0.45
     considérons
    0.44
    Act Density 0.190%

    No Known Activations