INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	vertex
    -0.08
     Customs
    -0.07
     fanc
    -0.07
    )},↵
    -0.07
    cken
    -0.07
     суще
    -0.07
     UR
    -0.06
     UT
    -0.06
     ч
    -0.06
    _HTTP
    -0.06
    POSITIVE LOGITS
     goals
    0.20
     goal
    0.15
     Goals
    0.13
    Goals
    0.13
    Goal
    0.10
    goal
    0.10
    goals
    0.10
    -goal
    0.10
     Goal
    0.10
    _goal
    0.08
    Act Density 0.021%

    No Known Activations