INDEX
    Explanations

    focusing on concepts or actions

    New Auto-Interp
    Negative Logits
    и
    0.55
    z
    0.50
    v
    0.50
    0.49
    er
    0.48
    disease
    0.48
    cy
    0.48
    experienced
    0.48
    n
    0.48
    selling
    0.47
    POSITIVE LOGITS
     comentários
    0.50
     números
    0.46
     beban
    0.45
     sediment
    0.44
     sermon
    0.44
     poked
    0.43
    Kost
    0.43
     meditative
    0.43
     tesis
    0.42
     lom
    0.42
    Act Density 0.005%

    No Known Activations