INDEX
    Explanations

    phrases related to authority and control

    New Auto-Interp
    Negative Logits
    ſelf
    -0.75
    SpringBootTest
    -0.63
     ſtate
    -0.60
     ſta
    -0.59
    (__('
    -0.58
     ſche
    -0.57
     {*}
    -0.57
     pleaſure
    -0.57
     faſt
    -0.56
     ujednoznacz
    -0.55
    POSITIVE LOGITS
     meille
    0.45
     ilman
    0.43
     vermelhas
    0.43
     Tiefen
    0.42
     Encuentra
    0.42
     väh
    0.42
     것은
    0.39
     เต
    0.39
     eikä
    0.39
     daž
    0.39
    Act Density 0.041%

    No Known Activations