INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ujednoznacz
    -0.75
     aapt
    -0.60
     HasFactory
    -0.59
     IBOutlet
    -0.57
     betweenstory
    -0.54
    ValueStyle
    -0.52
    elemField
    -0.52
     становника
    -0.50
    rungsseite
    -0.49
     Италијани
    -0.49
    POSITIVE LOGITS
    When
    0.73
     When
    0.69
    If
    0.59
    Whenever
    0.56
     Když
    0.55
     Quando
    0.54
    Quando
    0.54
     Когда
    0.54
     Lorsqu
    0.53
     If
    0.52
    Act Density 0.037%

    No Known Activations