INDEX
Explanations
verbs related to attempts or efforts
instances of the word "trying."
New Auto-Interp
Negative Logits
dylib
-0.78
cised
-0.70
lav
-0.63
friends
-0.62
anon
-0.61
thus
-0.60
oola
-0.60
DragonMagazine
-0.59
ros
-0.59
ifa
-0.58
POSITIVE LOGITS
unsuccessfully
1.06
desperately
0.92
harder
0.85
vain
0.76
reprene
0.72
ioned
0.71
ichick
0.70
valiant
0.69
frantically
0.67
wark
0.67
Activations Density 0.044%