INDEX
Explanations
expressions of thought and questioning
New Auto-Interp
Negative Logits
propOrder
-0.46
LLocation
-0.35
intios
-0.34
见状
-0.33
ttemberg
-0.33
XtraReports
-0.32
styleType
-0.32
WebVitals
-0.32
AttributeSet
-0.32
AnchorStyles
-0.32
POSITIVE LOGITS
thinking
3.06
thought
2.81
thinking
2.73
Thinking
2.70
Thinking
2.67
thoughts
2.63
thought
2.50
think
2.47
THINK
2.45
Think
2.39
Activations Density 0.632%