INDEX
    Explanations

    information related to personal experiences and interactions

    phrases related to personal experiences and observations

    New Auto-Interp
    Negative Logits
    ]."
    -0.67
    )].
    -0.66
    .</
    -0.64
    %.
    -0.63
    ?".
    -0.62
    ]).
    -0.61
    ));
    -0.61
    )).
    -0.59
    NetMessage
    -0.56
    .''
    -0.56
    POSITIVE LOGITS
     travelled
    0.61
     Nottingham
    0.56
     Downing
    0.55
    ortmund
    0.55
    ETHOD
    0.53
     replaced
    0.52
     Rutherford
    0.52
     confirmed
    0.52
    arij
    0.51
     booted
    0.51
    Act Density 1.534%

    No Known Activations