INDEX
Explanations
phrases related to observation and insight
phrases indicating perception or observation
New Auto-Interp
Negative Logits
ufact
-0.75
isure
-0.73
ensured
-0.72
assisted
-0.70
inqu
-0.67
keeping
-0.65
arist
-0.64
eta
-0.63
ensures
-0.63
fact
-0.63
POSITIVE LOGITS
resemblance
1.11
similarities
1.09
difference
1.04
similarity
0.97
sunrise
0.96
parallels
0.96
signs
0.95
devastation
0.95
reflection
0.91
positives
0.90
Activations Density 0.175%