INDEX
Explanations
phrases related to standing up against challenges or obstacles
New Auto-Interp
Negative Logits
kson
-0.70
ataka
-0.67
ades
-0.61
alez
-0.59
captcha
-0.58
oldown
-0.58
iche
-0.58
aceutical
-0.57
tymology
-0.56
retaining
-0.56
POSITIVE LOGITS
ered
1.17
biz
1.17
alter
1.05
manship
0.95
runners
0.93
rooms
0.93
downs
0.91
case
0.86
cases
0.84
room
0.82
Activations Density 0.506%