INDEX
    Explanations

    geographical names and locations

    New Auto-Interp
    Negative Logits
    ZF
    -0.17
    akis
    -0.17
    roti
    -0.15
     ди
    -0.15
    -runtime
    -0.15
     пÑĥ
    -0.14
     Tulsa
    -0.14
    kara
    -0.14
    ariat
    -0.14
    iou
    -0.14
    POSITIVE LOGITS
     Madison
    0.26
     Dane
    0.24
     Wis
    0.20
     Bad
    0.20
    Mad
    0.19
    Wis
    0.18
    mad
    0.18
     Marathon
    0.17
    Bad
    0.17
     Wisconsin
    0.17
    Act Density 0.025%

    No Known Activations