INDEX
Explanations
the ability to take actions or perform tasks
New Auto-Interp
Negative Logits
springfox
-0.66
hdashline
-0.65
OrNil
-0.64
onCancelled
-0.63
psum
-0.62
mockMvc
-0.58
GenerationType
-0.58
MCD
-0.58
רושלים
-0.57
pstead
-0.57
POSITIVE LOGITS
able
1.02
Able
0.92
Able
0.83
bodied
0.80
ability
0.65
bodied
0.61
Ability
0.60
abilities
0.58
Abilities
0.58
to
0.58
Activations Density 0.055%