INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ارائه
    -0.07
     erh
    -0.07
     نش
    -0.06
    -0.06
    	char
    -0.06
    ,True
    -0.06
    ,char
    -0.06
    @"
    -0.06
     надання
    -0.06
    Sampler
    -0.06
    POSITIVE LOGITS
    	Intent
    0.07
     spacer
    0.07
    сер
    0.07
     Strikes
    0.06
    .Point
    0.06
     Ago
    0.06
     drug
    0.06
     rating
    0.06
     Coaching
    0.06
    <|end_header_id|>
    0.06
    Act Density 0.017%

    No Known Activations