INDEX
    Explanations

    drawing analogies with like/as

    New Auto-Interp
    Negative Logits
    일까지
    0.83
    역시
    0.79
     మాత్రం
    0.79
     까지
    0.77
     حتی
    0.77
    even
    0.77
     Even
    0.76
     nawet
    0.76
     вовсе
    0.75
     even
    0.73
    POSITIVE LOGITS
     headlights
    0.83
     upscale
    0.81
     highlighter
    0.81
     appetizers
    0.79
     speedometer
    0.78
     someone
    0.77
     somebody
    0.77
     quelqu
    0.76
     punya
    0.76
     catcher
    0.75
    Act Density 0.482%

    No Known Activations