INDEX
    Explanations

    signals intent or meaning

    New Auto-Interp
    Negative Logits
    Signs
    0.47
     plaques
    0.45
     Signs
    0.45
     stamps
    0.44
     símbolos
    0.43
     signs
    0.42
     begeistert
    0.40
     يجعل
    0.40
     симпто
    0.40
     morphologies
    0.39
    POSITIVE LOGITS
     intentions
    0.64
     принадле
    0.62
     impending
    0.58
    Intent
    0.57
     belonging
    0.56
    确实
    0.55
     presence
    0.54
     indicating
    0.53
     intend
    0.53
     denoting
    0.52
    Act Density 0.043%

    No Known Activations