INDEX
    Explanations

    the word "again" at different levels of activation

    New Auto-Interp
    Negative Logits
    تفصیلات
    -0.60
     kog
    -0.58
     kard
    -0.56
     uhr
    -0.53
     meras
    -0.52
     buone
    -0.51
    -0.51
    Transcrip
    -0.51
     sembla
    -0.51
     vira
    -0.50
    POSITIVE LOGITS
     schoolmaster
    0.89
     wanderer
    0.82
     parson
    0.78
     redhead
    0.76
     gladiator
    0.75
     indestru
    0.74
     pamph
    0.74
     countryman
    0.73
     steamboat
    0.73
     rascal
    0.73
    Act Density 0.072%

    No Known Activations