INDEX
    Explanations

    phrases indicating classification or evaluation of objects and concepts

    New Auto-Interp
    Negative Logits
    ipes
    -0.15
    ppe
    -0.15
    abyrin
    -0.15
    tem
    -0.15
    redits
    -0.15
    iferay
    -0.14
    adows
    -0.14
    utoff
    -0.13
    inds
    -0.13
    agon
    -0.13
    POSITIVE LOGITS
    azzi
    0.16
    enance
    0.15
    YL
    0.14
     Provid
    0.14
    éϵ
    0.14
    μία
    0.14
     Barbar
    0.13
    quals
    0.13
    Fair
    0.13
    elian
    0.13
    Act Density 0.050%

    No Known Activations