INDEX
    Explanations

    references to the United States

    New Auto-Interp
    Negative Logits
     itſelf
    -0.60
     atof
    -0.56
     فريبيس
    -0.53
    दा
    -0.53
    ChildScrollView
    -0.52
     whoſe
    -0.49
     LXXX
    -0.49
     Theſe
    -0.48
     Weich
    -0.47
    روا
    -0.46
    POSITIVE LOGITS
     ویکی‌پدیا
    0.68
    Климат
    0.54
     vlo
    0.52
     >=",
    0.50
     distanciation
    0.50
     Coast
    0.48
    unesse
    0.47
    Autoritní
    0.47
     GreatSchools
    0.47
     بيها
    0.46
    Act Density 0.316%

    No Known Activations