INDEX
Explanations
instances of significant change or transition in roles or circumstances
New Auto-Interp
Negative Logits
kud
-0.15
ungan
-0.15
kdir
-0.14
dbe
-0.14
664
-0.14
orthy
-0.14
MEA
-0.14
itchen
-0.13
inux
-0.13
าà¸Ļ
-0.13
POSITIVE LOGITS
decision
0.22
Decision
0.18
decision
0.18
Simply
0.17
reasons
0.16
Too
0.16
conclusion
0.16
Decision
0.16
decided
0.15
Simply
0.15
Activations Density 0.195%