INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Haitian
    -0.65
    ACTED
    -0.64
     Romanian
    -0.64
     Haram
    -0.62
     Scotia
    -0.62
     concess
    -0.60
    _-
    -0.59
     âĸĪ
    -0.59
     Croatian
    -0.58
     SCP
    -0.58
    POSITIVE LOGITS
    acca
    0.91
    alk
    0.76
    abe
    0.76
    agos
    0.76
    aney
    0.75
    ynski
    0.75
    ensen
    0.74
    onson
    0.73
    rouse
    0.73
    iege
    0.71
    Act Density 0.067%

    No Known Activations