INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Collin
    0.43
    Siempre
    0.38
    Gew
    0.37
    0.37
     ощущение
    0.36
    είς
    0.36
    Nub
    0.36
    eric
    0.36
    }):
    0.36
     Nod
    0.36
    POSITIVE LOGITS
     capabilities
    1.33
     abilities
    1.29
     способности
    1.23
     kemampuan
    1.21
     ability
    1.20
    能力
    1.17
     capability
    1.15
    capabilities
    1.09
     क्षमताओं
    1.08
     prowess
    1.07
    Act Density 0.025%

    No Known Activations