INDEX
Explanations
The neuron is looking for words related to facts or issues
words related to functionality and practicality in various contexts
New Auto-Interp
Negative Logits
ãģ¯
-0.73
CHAT
-0.69
Veter
-0.68
Sav
-0.68
âĸĪâĸĪâĸĪâĸĪ
-0.68
Frameworks
-0.67
ARDS
-0.66
ðĿ
-0.66
================
-0.66
YouTube
-0.65
POSITIVE LOGITS
ional
1.40
acia
0.91
ially
0.89
terness
0.85
ities
0.83
brill
0.82
ised
0.82
ism
0.81
sized
0.80
glances
0.79
Activations Density 0.010%