INDEX
    Explanations

    phrases related to statistical bias in measurements

    New Auto-Interp
    Negative Logits
    &apos
    -0.21
     
    -0.17
    ?"↵↵↵↵
    -0.15
    [^
    -0.15
    ůvod
    -0.14
    одо
    -0.14
    ↵
    -0.14
      
    -0.14
     ↵↵
    -0.14
        
    -0.13
    POSITIVE LOGITS
     Â
    0.37
    ÂĶ
    0.36
    Âĵ
    0.32
    Âħ
    0.26
     Falk
    0.25
    Âĸ
    0.24
    Âij
    0.23
     Argentine
    0.23
    ÂĴ
    0.22
     Ur
    0.21
    Act Density 0.005%

    No Known Activations