INDEX
Explanations
phrases conveying positive sentiment or approval
positive affirmations or commendatory phrases
New Auto-Interp
Negative Logits
Downloadha
-0.74
abel
-0.74
minent
-0.72
ËĪ
-0.69
IDES
-0.69
Strauss
-0.69
agin
-0.68
actionDate
-0.68
resent
-0.67
URA
-0.65
POSITIVE LOGITS
job
1.18
luck
1.12
bye
1.08
timing
1.06
thing
1.03
grief
0.96
idea
0.94
heavens
0.94
gracious
0.92
bye
0.91
Activations Density 0.093%