INDEX
    Explanations

    phrases indicating requirements or necessities

    New Auto-Interp
    Negative Logits
    etros
    -0.14
    mdb
    -0.14
    ylko
    -0.13
    uarios
    -0.13
    romium
    -0.13
    ınca
    -0.13
    ancock
    -0.13
    rypton
    -0.13
    ekyll
    -0.13
     rá»Ļng
    -0.13
    POSITIVE LOGITS
     exactly
    0.23
     precisely
    0.21
     právÄĽ
    0.20
     именно
    0.20
     perfectly
    0.18
     literal
    0.17
     Exactly
    0.17
     Ñģаме
    0.17
     totiž
    0.16
     literally
    0.16
    Act Density 0.079%

    No Known Activations