INDEX
    Explanations

    phases of projects/studies

    New Auto-Interp
    Negative Logits
     phase
    -1.80
    phase
    -1.68
    Phase
    -1.43
    PHASE
    -1.42
     PHASE
    -1.41
     Phase
    -1.39
     phases
    -1.34
    phases
    -1.33
     fase
    -1.13
     Phases
    -1.13
    POSITIVE LOGITS
    wa
    0.45
    хова
    0.43
    pa
    0.43
     anla
    0.42
    Sna
    0.42
     swimmer
    0.42
    med
    0.42
    als
    0.42
     kissing
    0.41
    hea
    0.40
    Act Density 0.011%

    No Known Activations