INDEX
    Explanations

    proper nouns or names

    proper nouns related to organizations, places, and brands

    New Auto-Interp
    Negative Logits
    etheless
    -0.61
     separatist
    -0.60
     depreciation
    -0.55
     inherit
    -0.54
     plag
    -0.54
     dracon
    -0.53
     Rebels
    -0.53
     retaliate
    -0.53
     polarized
    -0.53
     sort
    -0.53
    POSITIVE LOGITS
    ona
    1.00
    oya
    0.88
    isha
    0.83
    inda
    0.81
    ley
    0.80
    ado
    0.79
    leys
    0.79
    onda
    0.79
    aci
    0.77
    ena
    0.77
    Act Density 0.457%

    No Known Activations