INDEX
    Explanations

    sentences that express strong opinions or judgements

    New Auto-Interp
    Negative Logits
    exus
    -0.16
     Angiospermae
    -0.16
    stad
    -0.15
    AXB
    -0.14
    мÑĭ
    -0.14
    fra
    -0.14
    rastructure
    -0.14
    æ¥ŃåĭĻ
    -0.14
    lesh
    -0.13
    klä
    -0.13
    POSITIVE LOGITS
    .JSON
    0.15
    aly
    0.14
    าà¸Ķ
    0.14
    eil
    0.14
    inox
    0.13
    onga
    0.13
     Blockly
    0.13
    åºŃ
    0.13
     main
    0.13
    ubo
    0.13
    Act Density 0.018%

    No Known Activations