INDEX
    Explanations

    references to relationships and interactions between people

    "side" after certain prepositions or quantifiers

    New Auto-Interp
    Negative Logits
    ConstraintMaker
    -0.45
    をも
    -0.42
    avut
    -0.39
    ographe
    -0.38
    -0.37
    jstor
    -0.36
     بهم
    -0.36
    Separator
    -0.36
     Abstand
    -0.36
     Roumanie
    -0.35
    POSITIVE LOGITS
     side
    2.86
     sides
    2.55
     Side
    2.45
    Side
    2.36
    side
    2.34
     SIDE
    2.32
    SIDE
    2.23
    sides
    2.13
     Sides
    2.05
    Sides
    1.95
    Act Density 0.423%

    No Known Activations