INDEX
    Explanations

    statements or claims regarding facts

    New Auto-Interp
    Negative Logits
    IFT
    -0.15
    ÄĻk
    -0.15
    ivalent
    -0.15
    é£
    -0.15
    entiful
    -0.15
    nothrow
    -0.14
    ayacak
    -0.14
    _RESOURCES
    -0.14
    äºľ
    -0.14
    akra
    -0.13
    POSITIVE LOGITS
     even
    0.18
    934
    0.15
     sogar
    0.15
    ored
    0.15
     arguably
    0.15
    661
    0.14
    881
    0.14
    749
    0.14
    674
    0.14
    ually
    0.14
    Act Density 0.031%

    No Known Activations