INDEX
    Explanations

    phrases that assert or question knowledge or claims

    New Auto-Interp
    Negative Logits
    NameInMap
    -0.59
    kloped
    -0.55
    Higgins
    -0.53
     Erzb
    -0.50
    displayquote
    -0.47
     caller
    -0.46
    SpringBootTest
    -0.45
    uhi
    -0.45
    )(((
    -0.45
    Picking
    -0.45
    POSITIVE LOGITS
     Waray
    0.78
     Numerade
    0.75
     cref
    0.72
    VersionUID
    0.70
     متعلقه
    0.68
     AssemblyCompany
    0.66
    ритори
    0.66
     umana
    0.65
     betweenstory
    0.65
    ümüş
    0.63
    Act Density 0.005%

    No Known Activations