INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     it
    0.74
     olev
    0.74
     okaz
    0.71
     comprende
    0.67
     stari
    0.67
     прав
    0.66
     sağlam
    0.65
     mitä
    0.65
     actitudes
    0.65
     জেট
    0.65
    POSITIVE LOGITS
    0.85
    տ
    0.84
    คำ
    0.76
    0.74
     Tahiti
    0.70
    ,\
    0.69
    ),'
    0.67
    일에
    0.67
    вання
    0.66
    ORIAL
    0.66
    Act Density 0.006%

    No Known Activations