INDEX
    Explanations

    references to identical entities or concepts in various contexts

    "same" followed by referring noun

    New Auto-Interp
    Negative Logits
    ctivité
    -0.41
    Spann
    -0.39
    Shel
    -0.36
    <?
    -0.36
    ßt
    -0.34
    Cant
    -0.34
    ERGY
    -0.33
    Psalms
    -0.32
     тебе
    -0.32
    preneurs
    -0.31
    POSITIVE LOGITS
     same
    0.93
     desselben
    0.80
    same
    0.78
     mesmas
    0.75
     dasselbe
    0.71
     selben
    0.71
     medesimo
    0.71
     gleiche
    0.69
     dieselbe
    0.69
     derselben
    0.67
    Act Density 0.024%

    No Known Activations