INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    highly
    0.62
    Large
    0.61
     verschiedenen
    0.60
     다양한
    0.60
     belirli
    0.60
    large
    0.59
     birbirinden
    0.58
     karşınız
    0.57
    不同
    0.57
    Ancient
    0.56
    POSITIVE LOGITS
     entrepreneurs
    1.26
     economists
    1.19
     physicists
    1.17
     journalists
    1.16
     astronomers
    1.13
     activists
    1.13
     engineers
    1.12
     marketers
    1.10
     researchers
    1.10
     scientists
    1.09
    Act Density 0.230%

    No Known Activations