INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    UNE
    -0.07
     coloring
    -0.07
     Development
    -0.07
     CURRENT
    -0.06
     Developing
    -0.06
     граф
    -0.06
    -speed
    -0.06
     więcej
    -0.06
    ,'
    -0.06
     caller
    -0.06
    POSITIVE LOGITS
    .Aggressive
    0.07
    afx
    0.06
    99
    0.06
    296
    0.06
    Кон
    0.06
    ):
    ↵
    ↵
    0.06
    .token
    0.06
    -hearted
    0.06
    _markers
    0.06
    _axes
    0.06
    Act Density 0.000%

    No Known Activations