INDEX
    Explanations

    references to the Philippines and its people

    Philippines and Filipino

    New Auto-Interp
    Negative Logits
    BagLayout
    -0.62
     surla
    -0.60
    ArgsConstructor
    -0.58
     الرياضيه
    -0.53
    UnitTesting
    -0.51
     Viana
    -0.50
    🦯
    -0.50
    ragalactic
    -0.50
    Décès
    -0.50
     isolado
    -0.49
    POSITIVE LOGITS
     Philippines
    1.08
    Philippines
    1.02
     Philippine
    0.92
    Philippine
    0.90
     Filipino
    0.82
     Filipinos
    0.80
     Filipina
    0.79
     Filipinas
    0.74
     philippines
    0.73
     philipp
    0.71
    Act Density 0.003%

    No Known Activations