INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Coffee
    -0.08
     big
    -0.08
     really
    -0.07
     Alternative
    -0.07
     abolished
    -0.07
     Cognitive
    -0.07
     Farm
    -0.07
     primary
    -0.07
     low
    -0.06
     Kal
    -0.06
    POSITIVE LOGITS
     perfect
    0.09
     lined
    0.07
     استاد
    0.06
    ��
    0.06
    DH
    0.06
    avatel
    0.06
     İran
    0.06
     آموزش
    0.06
     Invoke
    0.06
     :)↵↵
    0.06
    Act Density 0.029%

    No Known Activations