INDEX
    Explanations

    mentions of aid or support, particularly in the context of communities and healthcare

    New Auto-Interp
    Negative Logits
    rik
    -0.15
    .flink
    -0.14
    hed
    -0.14
    >tag
    -0.14
    交
    -0.14
    quet
    -0.14
    mana
    -0.14
    alus
    -0.13
    ushima
    -0.13
    crest
    -0.13
    POSITIVE LOGITS
     fit
    0.18
     tailor
    0.17
     account
    0.17
     Tail
    0.17
     dedicate
    0.16
    عداد
    0.16
    fits
    0.16
     rises
    0.16
     rise
    0.16
     train
    0.16
    Act Density 0.009%

    No Known Activations