INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Kharkiv
    0.79
     Dortmund
    0.78
     Харків
    0.76
     train
    0.73
    Train
    0.73
     Shale
    0.72
    🚄
    0.72
     المصري
    0.72
    0.71
     ওয়ালপেপার
    0.70
    POSITIVE LOGITS
     Islands
    1.85
     island
    1.77
     islands
    1.77
     Island
    1.66
     Islanders
    1.54
    Island
    1.52
     islas
    1.51
     archipelago
    1.50
    island
    1.48
     Caribbean
    1.47
    Act Density 0.150%

    No Known Activations