INDEX
Explanations
expressions of gratitude and moral conflict
New Auto-Interp
Negative Logits
renheit
-0.82
Tonight
-0.74
click
-0.74
Exit
-0.69
Tomorrow
-0.62
agon
-0.62
Gran
-0.61
grass
-0.61
hog
-0.60
Maker
-0.60
POSITIVE LOGITS
ensured
0.83
merce
0.81
disclaim
0.78
contend
0.76
incentiv
0.73
conspic
0.73
ItemTracker
0.71
displayText
0.71
noted
0.71
acknowledge
0.69
Activations Density 0.254%