INDEX
    Explanations

    references to family, size, and grouping in various contexts

    New Auto-Interp
    Negative Logits
     misi
    -0.38
    .*")]
    -0.37
     witnesses
    -0.36
    NewLabel
    -0.35
     expired
    -0.34
    astify
    -0.34
    addGap
    -0.34
    pfung
    -0.33
    anoma
    -0.33
    🧼
    -0.33
    POSITIVE LOGITS
    large
    1.05
    Large
    1.05
     large
    1.02
     big
    0.96
     Large
    0.94
    LARGE
    0.93
    big
    0.91
     LARGE
    0.88
     großen
    0.84
    Larger
    0.84
    Act Density 0.078%

    No Known Activations