INDEX
    Explanations

    names, specifically those starting with "Dan."

    New Auto-Interp
    Negative Logits
     overs
    -0.61
     Trinidad
    -0.57
     instrument
    -0.57
    ĵĺ
    -0.55
     ECB
    -0.55
     footing
    -0.54
     rs
    -0.54
     disson
    -0.54
     violet
    -0.53
    ãģį
    -0.53
    POSITIVE LOGITS
    riot
    0.98
    nery
    0.97
    ulas
    0.97
    roman
    0.95
    riots
    0.92
    afort
    0.92
    esian
    0.91
    zl
    0.91
    atro
    0.90
    roach
    0.90
    Act Density 0.814%

    No Known Activations