INDEX
Explanations
expressions of thought or contemplation
"think" or "thinking"
thinking about or asking about
New Auto-Interp
Negative Logits
Guill
-0.47
»
-0.46
Kob
-0.44
«
-0.44
<eos>
-0.42
я
-0.41
»
-0.41
«
-0.40
Jacobsen
-0.40
Quig
-0.40
POSITIVE LOGITS
THINK
1.74
think
1.74
Think
1.73
Think
1.72
think
1.68
THINK
1.59
thinks
1.54
thinking
1.41
Thinking
1.36
thinking
1.34
Activations Density 0.179%