INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    fyr
    -0.09
    žen
    -0.08
    _ver
    -0.08
    <Application
    -0.08
    יינער
    -0.08
    _req
    -0.08
    ייטער
    -0.08
     fogu
    -0.08
    Difficulty
    -0.08
     teško
    -0.08
    POSITIVE LOGITS
    0.08
     techniek
    0.07
    า�
    0.07
     dalla
    0.07
     remote
    0.07
     simultaneous
    0.07
     Mystic
    0.06
     simultaneously
    0.06
     salted
    0.06
    ıp
    0.06
    Act Density 0.003%

    No Known Activations