INDEX
Explanations
phrases emphasizing advantages or benefits in various contexts
New Auto-Interp
Negative Logits
sillon
-0.61
izumi
-0.60
StructField
-0.60
Mun
-0.60
utuhkan
-0.59
ifilm
-0.59
letzt
-0.59
Atem
-0.59
</i>
-0.59
eryn
-0.59
POSITIVE LOGITS
Advantage
1.44
Advantage
1.36
advantage
1.35
advantages
1.35
advantage
1.35
ANTAGE
1.34
disadvantage
1.34
disadvantages
1.29
Advantages
1.27
Advantages
1.26
Activations Density 0.095%