INDEX
    Explanations

    the word "gar" with varying levels of activation

    the word "gar" in various contexts

    New Auto-Interp
    Negative Logits
    ivity
    -0.74
    anwhile
    -0.74
    vironment
    -0.72
    psey
    -0.71
    gdala
    -0.70
     reckoning
    -0.70
    orer
    -0.70
    terday
    -0.69
     bargaining
    -0.69
     constitu
    -0.68
    POSITIVE LOGITS
    gar
    1.01
    rets
    0.90
    bage
    0.89
    rier
    0.86
    neau
    0.84
    zik
    0.81
    rics
    0.79
    lean
    0.79
    rett
    0.78
    rius
    0.78
    Act Density 0.005%

    No Known Activations