INDEX
Explanations
language suggesting the potential to improve operations or processes
phrases indicating capability or potential
New Auto-Interp
Negative Logits
Fighter
-0.68
edient
-0.65
rehearsal
-0.64
striving
-0.62
Generation
-0.62
honoring
-0.61
revision
-0.60
guarding
-0.60
Moz
-0.59
Gadget
-0.59
POSITIVE LOGITS
't
1.48
adian
1.23
berra
1.18
NOT
1.03
vas
0.98
attest
0.94
tera
0.92
isters
0.90
thus
0.87
afford
0.84
Activations Density 0.176%