INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     family
    -1.95
    family
    -1.88
    Family
    -1.82
     Family
    -1.80
     FAMILY
    -1.69
    FAMILY
    -1.45
     famille
    -1.35
     família
    -1.25
     familiale
    -1.23
     keluarga
    -1.18
    POSITIVE LOGITS
    )";
    
    0.62
     member
    0.59
     of
    0.58
    })`
    0.58
     members
    0.56
    )))
    
    0.52
    ")))
    0.52
    )"),
    0.51
    ))).
    0.48
    )*/
    0.48
    Act Density 0.081%

    No Known Activations