INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    我が家の
    -0.55
    ConstraintMaker
    -0.51
    avax
    -0.50
     alcuna
    -0.49
    laught
    -0.49
     GenerationType
    -0.48
     trasparente
    -0.47
    freies
    -0.47
     Ignite
    -0.47
    ワイイ
    -0.46
    POSITIVE LOGITS
    People
    1.57
    people
    1.55
     people
    1.52
     People
    1.50
     PEOPLE
    1.46
    PEOPLE
    1.41
     ppl
    1.02
     peop
    0.96
     peoples
    0.90
     Menschen
    0.90
    Act Density 0.099%

    No Known Activations