INDEX
Explanations
mentions of specific details or facts within a broader context
instances of the phrase "noting that."
New Auto-Interp
Negative Logits
Escape
-0.56
live
-0.56
angle
-0.55
als
-0.55
ular
-0.54
lives
-0.54
Hack
-0.54
oi
-0.53
cles
-0.52
ug
-0.52
POSITIVE LOGITS
noting
3.21
mentioning
2.04
citing
1.86
stating
1.84
emphasizing
1.83
acknowledging
1.83
stressing
1.72
pointing
1.71
noticing
1.67
estimating
1.58
Activations Density 0.009%