INDEX
    Explanations

    is a, is based, is designed

    New Auto-Interp
    Negative Logits
    不錯
    0.28
    重要
    0.21
     γιατί
    0.21
    很重要
    0.21
    不错
    0.21
     craziness
    0.20
     annan
    0.20
     중요
    0.20
    或者
    0.20
     incorporación
    0.20
    POSITIVE LOGITS
     characterized
    0.51
     designed
    0.48
     characterised
    0.45
     comprised
    0.44
     able
    0.44
     composed
    0.43
     fundamentally
    0.40
     based
    0.39
     imbued
    0.39
     replete
    0.37
    Act Density 0.270%

    No Known Activations