INDEX
Explanations
phrases containing the word "try" followed by an action
instances of attempts or efforts to try something new or experimental
New Auto-Interp
Negative Logits
Deaths
-0.72
shire
-0.67
Journal
-0.67
ifa
-0.66
Klu
-0.63
ufact
-0.63
Printed
-0.62
Scotland
-0.61
concerns
-0.61
SOURCE
-0.61
POSITIVE LOGITS
harder
0.93
unsuccessfully
0.89
hardest
0.88
unal
0.81
ocre
0.80
experiment
0.78
trick
0.78
patience
0.76
icide
0.72
onz
0.69
Activations Density 0.083%