INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     betweenstory
    -0.73
     surla
    -0.72
     חיצוניים
    -0.71
    GEBURTSDATUM
    -0.68
    InvalidProtocol
    -0.68
     وتسجيلات
    -0.67
     незавершена
    -0.67
     télécharge
    -0.66
     BrowserModule
    -0.65
    IntoConstraints
    -0.65
    POSITIVE LOGITS
     rest
    0.75
     consider
    0.59
     focus
    0.55
     listen
    0.54
     let
    0.54
     examine
    0.53
     read
    0.53
     study
    0.52
    rest
    0.52
    Rest
    0.52
    Act Density 0.037%

    No Known Activations