INDEX
    Explanations

    crucial / essential / important

    New Auto-Interp
    Negative Logits
    してみました
    0.40
     authorized
    0.38
     interesting
    0.38
     admired
    0.38
     authorised
    0.38
    浪漫
    0.38
     interessant
    0.37
    ロッパ
    0.36
    orette
    0.36
    ಿಸಬಹುದು
    0.36
    POSITIVE LOGITS
     crucial
    2.06
     중요
    1.73
     중요하다
    1.65
     vital
    1.62
     essential
    1.57
     penting
    1.53
    很重要
    1.53
    重要
    1.52
     важно
    1.52
     필수
    1.48
    Act Density 0.055%

    No Known Activations