INDEX
Explanations
The neuron is looking for phrases indicating uncertainty or contemplation about a concept or idea
phrases that express the concept of "what it is."
New Auto-Interp
Negative Logits
teness
-0.84
Extend
-0.68
oso
-0.67
Shall
-0.65
ichever
-0.63
Refresh
-0.61
Hitch
-0.60
locks
-0.58
uish
-0.57
pend
-0.57
POSITIVE LOGITS
supposed
0.84
nt
0.80
olate
0.76
Ĥİ
0.75
meant
0.75
actually
0.73
going
0.73
akin
0.72
intrinsically
0.72
presently
0.72
Activations Density 0.321%