INDEX
Explanations
verbs related to ability, capability, or possibility
New Auto-Interp
Negative Logits
Parents
-0.69
Tradition
-0.62
-0.60
aternity
-0.60
rium
-0.58
friend
-0.57
adish
-0.57
Nose
-0.56
dress
-0.56
dad
-0.54
POSITIVE LOGITS
bodied
1.01
ioned
0.95
to
0.76
ta
0.76
Reviewer
0.75
bod
0.74
reys
0.72
compe
0.69
simultane
0.69
Osw
0.67
Activations Density 1.517%