INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _BEGIN
    -0.07
     شکن
    -0.06
    -create
    -0.06
    Sau
    -0.06
    Whole
    -0.06
    -0.06
     Gould
    -0.06
     aday
    -0.06
    toy
    -0.06
    alphabet
    -0.06
    POSITIVE LOGITS
     traffic
    0.14
     Traffic
    0.12
    traffic
    0.11
    Traffic
    0.10
    (^
    0.08
     ніч
    0.08
     дорож
    0.07
    0.07
    Ice
    0.07
    ,data
    0.06
    Act Density 0.007%

    No Known Activations