INDEX
    Explanations

    phrases containing the word "try" followed by an action

    instances of attempts or efforts to try something new or experimental

    New Auto-Interp
    Negative Logits
     Deaths
    -0.72
    shire
    -0.67
    Journal
    -0.67
    ifa
    -0.66
     Klu
    -0.63
    ufact
    -0.63
     Printed
    -0.62
    Scotland
    -0.61
     concerns
    -0.61
    SOURCE
    -0.61
    POSITIVE LOGITS
     harder
    0.93
     unsuccessfully
    0.89
     hardest
    0.88
    unal
    0.81
    ocre
    0.80
     experiment
    0.78
     trick
    0.78
     patience
    0.76
    icide
    0.72
    onz
    0.69
    Act Density 0.083%

    No Known Activations