INDEX
    Explanations

    in phrases specifying parts

    New Auto-Interp
    Negative Logits
    0.64
    0.58
     beforehand
    0.58
    '),
    0.57
    0.55
     Kenyon
    0.55
    itania
    0.54
    0.53
     сейчас
    0.52
     Mekong
    0.52
    POSITIVE LOGITS
    不管是
    0.74
     včetně
    0.70
     incluindo
    0.69
    すなわち
    0.68
    including
    0.68
     özellikle
    0.67
     anzi
    0.67
    जिसे
    0.67
     INCLUDING
    0.66
     включая
    0.66
    Act Density 0.007%

    No Known Activations