INDEX
    Explanations

    terms related to military involvement and identification of specific entities

    after common words ("can," "the", "to", or "of")

    tax and then its impact

    New Auto-Interp
    Negative Logits
     amizade
    -0.47
     đình
    -0.43
    ждане
    -0.43
     preguntó
    -0.42
     sorriso
    -0.42
     pergunt
    -0.41
    aktery
    -0.41
    などを
    -0.41
     maioria
    -0.41
    zepte
    -0.41
    POSITIVE LOGITS
     Erişim
    0.74
    TestingModule
    0.73
    AddTagHelper
    0.71
     صوتيه
    0.69
     OMITBAD
    0.67
     متعلقه
    0.66
     ]
    
    0.65
    tonode
    0.65
     lenker
    0.65
    "")
    0.64
    Act Density 0.497%

    No Known Activations