INDEX
    Explanations

    names or terms related to Japanese culture or individuals

    New Auto-Interp
    Negative Logits
    tering
    -0.85
    matically
    -0.73
    sheet
    -0.71
    laughter
    -0.71
    ulence
    -0.70
    trace
    -0.70
    iaries
    -0.70
    drivers
    -0.69
    undy
    -0.67
    rodu
    -0.66
    POSITIVE LOGITS
    ichi
    1.39
     Tsuk
    1.34
     Yosh
    1.32
    oka
    1.31
     Tanaka
    1.27
    ishi
    1.26
     Nish
    1.24
     Mats
    1.24
     Tsu
    1.21
    hiro
    1.21
    Act Density 0.136%

    No Known Activations