INDEX
    Explanations

    names of countries or regions

    New Auto-Interp
    Negative Logits
    WATCH
    -0.74
     mechanically
    -0.72
     scratch
    -0.69
    PLAY
    -0.67
    FFER
    -0.66
     deflation
    -0.65
     curing
    -0.65
     crunch
    -0.64
    FIX
    -0.64
     underdog
    -0.64
    POSITIVE LOGITS
    aii
    1.05
    ensis
    1.01
    ui
    0.97
    ai
    0.94
    phia
    0.93
    anu
    0.92
    oru
    0.90
    Äģ
    0.89
    uni
    0.88
    ku
    0.88
    Act Density 0.261%

    No Known Activations