INDEX
    Explanations

    phrases indicating spatial relationships, particularly those involving "inside."

    New Auto-Interp
    Negative Logits
    ventional
    -0.70
    HasKey
    -0.69
    REEK
    -0.67
     Sera
    -0.66
    лару
    -0.66
    apunov
    -0.66
    ambung
    -0.65
     Mawr
    -0.65
    cillor
    -0.64
    pulumi
    -0.64
    POSITIVE LOGITS
     Inside
    1.61
    Inside
    1.52
     INSIDE
    1.50
    inside
    1.45
     inside
    1.42
    INSIDE
    1.38
     InputDecoration
    1.32
     Dentro
    1.07
     Outside
    1.03
    OUTSIDE
    1.02
    Act Density 0.065%

    No Known Activations