INDEX
Explanations
phrases related to shifting or maintaining a focus
New Auto-Interp
Negative Logits
ston
-0.71
OUGH
-0.69
paradise
-0.68
Haunted
-0.66
apons
-0.64
Emirates
-0.61
lyn
-0.61
mia
-0.60
attest
-0.60
Casino
-0.60
POSITIVE LOGITS
focus
0.94
peed
0.89
attention
0.89
focuses
0.87
Attention
0.86
focused
0.86
focus
0.84
rite
0.82
squarely
0.81
phasis
0.78
Activations Density 2.167%