INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    terness
    -0.81
    itiveness
    -0.75
     quotes
    -0.71
    76561
    -0.69
    owe
    -0.68
     attributes
    -0.65
    ynes
    -0.63
    gru
    -0.62
    endum
    -0.61
    peed
    -0.60
    POSITIVE LOGITS
     Guam
    0.93
     Taiwan
    0.87
     Azerb
    0.85
     Marshall
    0.80
     Greenland
    0.78
     Samoa
    0.77
    Tai
    0.77
     Philippines
    0.76
     Kazakhstan
    0.76
     Archangel
    0.75
    Act Density 0.073%

    No Known Activations