INDEX
    Explanations

    references to alternatives or options in various contexts

    New Auto-Interp
    Negative Logits
      
    -0.56
    -0.45
    ,
    -0.44
     "
    -0.42
     '
    -0.41
     (
    -0.40
       
    -0.39
    -0.39
     Pog
    -0.39
    -0.38
    POSITIVE LOGITS
     alternatives
    1.61
     Alternatives
    1.59
    alternatives
    1.57
    Alternatives
    1.52
     alternativas
    1.13
     فريبيس
    0.91
     '\\;'
    0.89
     substitutes
    0.88
     autorytatywna
    0.84
    0.83
    Act Density 0.010%

    No Known Activations