INDEX
Explanations
adjectives and nouns related to negative situations
phrases indicating potential outcomes or consequences
New Auto-Interp
Negative Logits
Downloadha
-0.81
ilian
-0.72
claim
-0.71
agree
-0.70
ystem
-0.67
emon
-0.67
urus
-0.66
allegedly
-0.63
formerly
-0.63
obia
-0.63
POSITIVE LOGITS
boon
1.00
next
0.98
tomorrow
0.95
someday
0.92
sooner
0.86
soon
0.84
beneficiary
0.77
forever
0.77
fruitful
0.77
2020
0.77
Activations Density 0.246%