INDEX
Explanations
positive adjectives describing something favorable or beneficial
phrases that express positive evaluations or affirmations regarding various subjects
New Auto-Interp
Negative Logits
racuse
-0.79
ategory
-0.79
opers
-0.73
ipient
-0.73
hyde
-0.71
ancies
-0.70
eters
-0.69
olson
-0.67
ppe
-0.66
ardy
-0.66
POSITIVE LOGITS
enough
1.05
enough
0.88
bye
0.81
fodder
0.80
consolation
0.78
synergy
0.78
news
0.77
surpr
0.77
luck
0.76
Enough
0.76
Activations Density 0.114%