INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     themſelves
    -0.89
    Искәрмәләр
    -0.83
    Билгалдахарш
    -0.80
     Saxons
    -0.76
     Nö
    -0.74
     ſeveral
    -0.74
    oporosis
    -0.73
    ]")]
    -0.73
     kamb
    -0.73
     Huguen
    -0.73
    POSITIVE LOGITS
     City
    1.35
     city
    1.35
     cities
    1.34
     CITY
    1.26
     Cities
    1.22
    city
    1.19
    cities
    1.17
     getCity
    1.16
    Cities
    1.15
    City
    1.14
    Act Density 0.049%

    No Known Activations