INDEX
    Explanations

    concepts related to social issues and personal values

    New Auto-Interp
    Negative Logits
    apan
    -0.48
    enumi
    -0.45
    Ni
    -0.44
    Nicole
    -0.43
    Met
    -0.42
    Ge
    -0.42
    Me
    -0.42
    ayangkan
    -0.42
    Pan
    -0.41
    Dr
    -0.40
    POSITIVE LOGITS
    OGND
    0.54
    UnitTesting
    0.50
    jspb
    0.50
     autorytatywna
    0.50
     ujednoznacz
    0.48
    Personendaten
    0.48
    =*/
    0.46
    Diweddarwch
    0.45
    ]")]
    0.44
     pinulongan
    0.43
    Act Density 0.259%

    No Known Activations