INDEX
Explanations
instances where someone is successful in achieving a goal or solving a problem
phrases indicating capability or successful achievements
New Auto-Interp
Negative Logits
Parents
-0.73
rium
-0.67
Tradition
-0.63
-0.62
Nose
-0.59
laundry
-0.57
VIDEOS
-0.56
machine
-0.56
Flags
-0.55
Kendall
-0.55
POSITIVE LOGITS
bodied
1.02
't
0.84
ioned
0.84
Reviewer
0.77
reys
0.76
istically
0.75
afford
0.73
bod
0.71
compe
0.70
untarily
0.68
Activations Density 0.040%