INDEX
Explanations
phrases related to doing one's best or making an effort
phrases expressing capability or potential actions
New Auto-Interp
Negative Logits
Lis
-0.65
Politics
-0.63
Cance
-0.63
Mour
-0.63
Passage
-0.63
Falk
-0.63
Ez
-0.62
Prin
-0.62
Rising
-0.61
Falling
-0.61
POSITIVE LOGITS
berra
1.05
't
1.04
muster
1.04
feas
0.93
adian
0.86
reasonably
0.84
nesota
0.82
afford
0.81
iary
0.79
NOT
0.79
Activations Density 0.094%