INDEX
    Explanations

    quotations and punctuation marks

    New Auto-Interp
    Negative Logits
     giao
    -0.75
    [::-
    -0.71
    ories
    -0.70
    ory
    -0.70
     IEnumerable
    -0.69
     Fle
    -0.69
     Montal
    -0.68
     Gier
    -0.67
    IEnumerable
    -0.67
    ly
    -0.67
    POSITIVE LOGITS
    }}"
    1.14
    %"
    1.14
    ]"
    1.11
     }}"
    1.09
    }"
    1.07
    )"
    1.06
    ?"
    1.04
    ')"
    1.03
    ="#"
    1.02
    ">"
    1.01
    Act Density 0.112%

    No Known Activations