INDEX
    Explanations

    words related to details or specific features in a context of a narrative or instructions

    New Auto-Interp
    Negative Logits
     brune
    -0.98
     alkoh
    -0.97
     depic
    -0.97
     McLaugh
    -0.94
     rigide
    -0.92
     fré
    -0.91
     apprehen
    -0.90
     silikon
    -0.90
     jette
    -0.88
     intersper
    -0.88
    POSITIVE LOGITS
     provides
    0.97
     allows
    0.95
     creates
    0.94
     does
    0.93
     gets
    0.93
     makes
    0.93
     gives
    0.92
     doesn
    0.90
     generates
    0.88
     goes
    0.87
    Act Density 0.541%

    No Known Activations