INDEX
    Explanations

    knowing and understanding

    New Auto-Interp
    Negative Logits
    真的是
    0.47
     진짜
    0.44
    有没有
    0.43
     skute
    0.43
     gerçekten
    0.42
    是否有
    0.42
    </h2>
    0.41
    真是
    0.41
     உண்மையில்
    0.40
    有沒有
    0.40
    POSITIVE LOGITS
     perfectamente
    0.76
     perfectly
    0.71
     perfettamente
    0.68
     parfaitement
    0.64
     intuitively
    0.61
     intellectually
    0.59
     instinctively
    0.59
     прекрасно
    0.56
     Perfectly
    0.56
     vaguely
    0.54
    Act Density 0.008%

    No Known Activations