INDEX
    Explanations

    locations and their surroundings

    New Auto-Interp
    Negative Logits
    cektir
    -0.60
    -0.51
    ESTON
    -0.49
     occupy
    -0.47
    hu
    -0.46
    adin
    -0.46
     Ad
    -0.45
     لدي
    -0.44
     have
    -0.43
     %
    -0.43
    POSITIVE LOGITS
     pleaſure
    0.82
    tagHelperRunner
    0.79
     myſelf
    0.78
    ſelf
    0.76
     houſe
    0.76
     виправивши
    0.76
     Majefty
    0.75
     poffible
    0.75
     Jefus
    0.75
    abestanden
    0.75
    Act Density 0.603%

    No Known Activations