INDEX
Explanations
conversational phrases that express uncertainty or questioning, particularly in collaborative or decision-making contexts
complex analysis and explanation
the neuron activates on content-bearing, informative tokens (important nouns/verbs/adjectives and discourse-focus words) rather than on function words.
New Auto-Interp
Negative Logits
مرئيه
-0.71
Бахар
-0.68
BeginContext
-0.66
lenker
-0.63
المعيارى
-0.56
contentLoaded
-0.54
ᅠ
-0.52
ſind
-0.50
styleType
-0.50
mergeFrom
-0.50
POSITIVE LOGITS
fromnode
0.47
复杂的
0.40
complex
0.39
complexo
0.37
thoughtful
0.36
复杂
0.36
compleja
0.36
active
0.36
ftagPool
0.34
Activités
0.33
Activations Density 0.110%