INDEX
Explanations
phrases expressing uncertainty or probability
New Auto-Interp
Negative Logits
ValueGenerated
-0.56
کلی
-0.52
Paglinawan
-0.49
daglig
-0.48
UPO
-0.48
chrétienne
-0.46
specchio
-0.45
attiva
-0.45
PMID
-0.45
rivol
-0.44
POSITIVE LOGITS
Possibly
1.63
perhaps
1.62
maybe
1.60
Possibly
1.59
Perhaps
1.59
perhaps
1.58
Perhaps
1.55
Maybe
1.52
possibly
1.51
Vielleicht
1.50
Activations Density 0.260%