INDEX
Explanations
references to increasing or elevating actions or states
New Auto-Interp
Negative Logits
iyle
-0.79
endregion
-0.78
]--;
-0.77
BEA
-0.73
DebuggerNonUser
-0.73
STC
-0.72
Criterion
-0.72
MDC
-0.71
SSC
-0.71
Modi
-0.70
POSITIVE LOGITS
up
1.82
Up
1.77
UP
1.74
Up
1.61
up
1.46
UP
1.44
AsUp
1.40
down
1.24
ups
1.22
Ups
1.20
Activations Density 0.123%