INDEX
    Explanations

    mentions of statistical data or trends

    New Auto-Interp
    Negative Logits
    <bos>
    -0.83
    relenting
    -0.70
    mistak
    -0.63
    chunky
    -0.60
    wavering
    -0.59
    classy
    -0.56
    kawaii
    -0.55
     peines
    -0.55
    spania
    -0.55
    snowy
    -0.54
    POSITIVE LOGITS
     Bekasi
    0.69
     azule
    0.67
     granada
    0.65
     Trá
    0.64
     magis
    0.64
     hcm
    0.63
     Almería
    0.62
     aen
    0.61
     Praça
    0.61
     tamen
    0.61
    Act Density 0.345%

    No Known Activations