INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    忽略
    0.50
     epistem
    0.46
     recommended
    0.44
    推奨
    0.44
    0.44
     deceptive
    0.44
     toric
    0.44
     totem
    0.42
     "
    0.42
     পায়নি
    0.42
    POSITIVE LOGITS
     rumours
    0.73
     rumour
    0.70
     rumors
    0.66
     rumoured
    0.61
     rumor
    0.60
    0.59
     rumores
    0.54
     अफवाह
    0.53
    0.51
    Rum
    0.49
    Act Density 0.018%

    No Known Activations