INDEX
Explanations
references to the strength or intensity of different aspects
the repeated emphasis on the term "the" in various contexts, indicating a focus on central themes or subjects in discussions
New Auto-Interp
Negative Logits
icia
-0.84
lance
-0.84
imity
-0.81
erity
-0.81
gob
-0.78
fficiency
-0.76
fulness
-0.76
worth
-0.76
earance
-0.76
frog
-0.75
POSITIVE LOGITS
situation
0.94
profession
0.93
relationship
0.92
cosmos
0.92
universe
0.91
equation
0.88
underlying
0.86
respective
0.83
corpus
0.82
environment
0.81
Activations Density 0.240%