INDEX
Explanations
discussions related to confusion and verification in communication
New Auto-Interp
Negative Logits
PREFERRED
-0.59
setopt
-0.55
Nachteile
-0.53
حاضر
-0.51
parha
-0.48
ElementException
-0.46
جغرافيا
-0.45
assoluto
-0.45
Preferred
-0.44
life
-0.44
POSITIVE LOGITS
maybe
1.23
Maybe
1.21
Maybe
1.15
Perhaps
1.12
maybe
1.09
perhaps
1.08
Perhaps
1.02
Possibly
1.02
perhaps
1.02
strange
0.99
Activations Density 0.457%