INDEX
Explanations
phrases related to assessing or questioning the effectiveness of a certain action or program
terms related to effectiveness and efficacy in various contexts
New Auto-Interp
Negative Logits
pper
-0.75
hak
-0.71
Rail
-0.71
ODE
-0.70
Pic
-0.68
Earth
-0.68
cise
-0.66
saw
-0.64
nee
-0.64
Barrel
-0.64
POSITIVE LOGITS
effectiveness
1.14
iveness
1.11
destro
0.97
acies
0.91
confir
0.90
iencies
0.85
tremend
0.85
guiActiveUn
0.83
eatures
0.83
olicy
0.82
Activations Density 0.009%