INDEX
Explanations
conjunctions like 'and' or 'or' connecting phrases with contrasting effects
comparative phrases discussing the interplay of positive and negative aspects
New Auto-Interp
Negative Logits
£ı
-0.95
ĸļ
-0.95
eday
-0.81
ItemThumbnailImage
-0.79
ESE
-0.75
Ħ¢
-0.73
monary
-0.72
20439
-0.71
dayName
-0.71
ģĸ
-0.70
POSITIVE LOGITS
bad
1.19
evil
1.05
bad
1.05
worst
1.04
drawbacks
1.02
downside
1.01
disapproval
1.01
disappoint
1.00
unpleasant
0.99
harmful
0.98
Activations Density 0.271%