INDEX
Explanations
phrases related to being the central focus of attention or discussion
occurrences of the phrase "subject of" followed by various topics or controversies
New Auto-Interp
Negative Logits
uncond
-0.70
frog
-0.65
nephew
-0.62
Americas
-0.60
chairs
-0.59
yeast
-0.59
coping
-0.59
Pegasus
-0.59
rha
-0.58
subsystem
-0.57
POSITIVE LOGITS
ENTION
1.00
ridicule
0.99
ire
0.92
scorn
0.86
actionDate
0.78
urst
0.74
mockery
0.73
spection
0.72
attention
0.71
stares
0.70
Activations Density 0.099%