INDEX
    Explanations

    unsettling situations expressed with unease

    New Auto-Interp
    Negative Logits
    ర్జాతీయ
    0.42
    0.40
     Symmetric
    0.40
    ネルギー
    0.38
    0.38
    0.38
     Similarity
    0.37
    0.37
    ్యాన్ని
    0.37
    ීම්
    0.37
    POSITIVE LOGITS
     pleased
    0.49
     bothered
    0.49
     perplexed
    0.47
     puzzled
    0.46
     perturbed
    0.45
     disgusted
    0.45
     startled
    0.44
    azed
    0.43
    0.43
     annoyed
    0.43
    Act Density 0.000%

    No Known Activations