INDEX
Explanations
phrases related to attention or emphasis on a particular subject or object
New Auto-Interp
Negative Logits
named
-0.73
OUGH
-0.73
ania
-0.73
mia
-0.69
ston
-0.66
mx
-0.62
BIT
-0.61
Haunted
-0.61
adding
-0.61
asca
-0.61
POSITIVE LOGITS
focus
1.02
rite
0.95
focused
0.93
focuses
0.89
foc
0.89
squarely
0.85
focusing
0.85
focus
0.83
peed
0.83
phasis
0.82
Activations Density 0.717%