INDEX
    Explanations

    words related to size or magnitude

    New Auto-Interp
    Negative Logits
     classNames
    -0.66
    epiece
    -0.65
     sanitarias
    -0.63
     alluminio
    -0.61
     professionale
    -0.61
     poésie
    -0.61
     estatales
    -0.61
     esterna
    -0.60
    titian
    -0.60
     során
    -0.59
    POSITIVE LOGITS
     big
    2.73
     huge
    2.15
    big
    2.01
     biggest
    1.90
     bigger
    1.88
     BIG
    1.77
    huge
    1.75
     HUGE
    1.72
     large
    1.72
    biggest
    1.66
    Act Density 0.049%

    No Known Activations