INDEX
    Explanations

    various forms of comparison and resemblance in descriptions

    New Auto-Interp
    Negative Logits
    httphttps
    -0.61
    fazer
    -0.50
     keduanya
    -0.49
     critères
    -0.48
     casada
    -0.48
     bezpiecze
    -0.47
     Herrn
    -0.47
     Geräten
    -0.46
     bestaan
    -0.45
     którzy
    -0.45
    POSITIVE LOGITS
     akin
    0.50
     Савезне
    0.46
     like
    0.46
     a
    0.44
     pseudo
    0.42
     bl
    0.42
     ers
    0.42
     jud
    0.41
     quasi
    0.41
     modern
    0.41
    Act Density 0.452%

    No Known Activations