INDEX
    Explanations

    conjunctive adverb or chain of thought

    New Auto-Interp
    Negative Logits
    िकुलम
    0.54
    itrile
    0.52
    0.50
     İstifadə
    0.48
    cupertino
    0.47
     করিয়াছিলেন
    0.47
     tasmim
    0.47
    0.47
    raj
    0.46
    Loksatta
    0.46
    POSITIVE LOGITS
     threads
    0.61
     slow
    0.53
     coaches
    0.52
     sinks
    0.51
     pillars
    0.50
     people
    0.49
     funds
    0.49
     pitches
    0.48
     colors
    0.48
     thread
    0.48
    Act Density 0.000%

    No Known Activations