INDEX
Explanations
phrases related to concentration or prioritization
repeated instances of the word "focus."
New Auto-Interp
Negative Logits
idden
-0.70
named
-0.67
added
-0.65
mia
-0.63
ania
-0.62
eries
-0.61
Gleaming
-0.61
ston
-0.61
attest
-0.61
OUGH
-0.60
POSITIVE LOGITS
rite
0.90
focus
0.86
Focus
0.83
peed
0.81
starter
0.80
rals
0.80
focus
0.79
attention
0.77
Focus
0.77
focusing
0.77
Activations Density 0.024%