INDEX
    Explanations

    temperature

    New Auto-Interp
    Negative Logits
     жизнь
    -0.07
    Aware
    -0.07
    _viewer
    -0.07
    reveal
    -0.06
     Geile
    -0.06
    886
    -0.06
    	strncpy
    -0.06
     birçok
    -0.06
    .Focused
    -0.06
    ляться
    -0.06
    POSITIVE LOGITS
     smoothing
    0.07
     zab
    0.07
    umm
    0.06
    plain
    0.06
    0.06
    PCI
    0.06
    sch
    0.06
    ]=]
    0.06
     fund
    0.06
     backdrop
    0.06
    Act Density 0.010%

    No Known Activations