INDEX
    Explanations

    phrases indicating importance or emphasis

    New Auto-Interp
    Negative Logits
     préfère
    -0.61
     déclare
    -0.57
    Produzione
    -0.57
    Oster
    -0.49
     prouve
    -0.49
     considère
    -0.49
     défend
    -0.47
    Πηγές
    -0.47
     prends
    -0.47
     lega
    -0.46
    POSITIVE LOGITS
     note
    1.02
    noted
    1.01
     notes
    1.01
     noted
    0.99
     noting
    0.97
    note
    0.89
    notes
    0.84
     Note
    0.84
     Notes
    0.83
     NOTES
    0.82
    Act Density 0.050%

    No Known Activations