INDEX
    Explanations

    expressions related to assumptions or conclusions

    New Auto-Interp
    Negative Logits
    DockStyle
    -0.75
    RegressionTest
    -0.74
     queſta
    -0.70
    HomeAsUpEnabled
    -0.67
    ftagPool
    -0.63
    TagMode
    -0.61
    StructEnd
    -0.60
    StreetMap
    -0.60
    UnknownFieldSet
    -0.59
     stockbild
    -0.57
    POSITIVE LOGITS
     be
    2.25
     быть
    1.06
     been
    1.01
     være
    1.00
     być
    0.99
     essere
    0.94
     être
    0.93
     have
    0.91
     belong
    0.90
     být
    0.90
    Act Density 1.258%

    No Known Activations