INDEX
    Explanations

    various languages express "said"

    New Auto-Interp
    Negative Logits
     PETER
    0.46
     FRO
    0.44
     FRANCIS
    0.43
     AFTER
    0.43
     FROM
    0.42
     JOSEPH
    0.42
     WITH
    0.41
     VIC
    0.41
     www
    0.40
     ELECTRON
    0.40
    POSITIVE LOGITS
    操作
    0.44
     말했다
    0.43
     esprim
    0.42
     şöyle
    0.42
    expression
    0.42
     berkata
    0.41
     व्यक्त
    0.41
     expression
    0.40
    表达
    0.40
    ロボ
    0.40
    Act Density 0.001%

    No Known Activations