INDEX
Explanations
instances of emphasis or importance
instances of the word "even."
New Auto-Interp
Negative Logits
rend
-0.88
aim
-0.82
hammad
-0.72
Exit
-0.71
unity
-0.71
scribe
-0.70
cel
-0.68
aye
-0.68
asy
-0.66
main
-0.66
POSITIVE LOGITS
remotely
0.93
though
0.91
tho
0.90
handedly
0.89
joked
0.78
hinted
0.76
handed
0.75
briefly
0.74
managed
0.72
entertained
0.70
Activations Density 0.056%