INDEX
    Explanations

    adverbs or intensifiers

    New Auto-Interp
    Negative Logits
     he
    0.41
     they
    0.41
     whose
    0.39
     where
    0.39
    nero
    0.39
     particulares
    0.39
    百姓
    0.39
    liance
    0.37
     Offering
    0.36
    മ്പ്
    0.36
    POSITIVE LOGITS
    водится
    0.45
    Especially
    0.43
    رحله
    0.43
     resonates
    0.42
    رحب
    0.42
    much
    0.42
     adheres
    0.42
     terutama
    0.42
     solely
    0.41
     especialmente
    0.41
    Act Density 0.003%

    No Known Activations