INDEX
Explanations
This neuron detects when the text is asking about or stating “relevance,” i.e. it flags relevance-evaluation language.
New Auto-Interp
Negative Logits
áreas
-0.07
ега
-0.07
ınıza
-0.06
experiencia
-0.06
pools
-0.06
.dp
-0.06
_front
-0.06
aland
-0.06
CEEDED
-0.06
.IndexOf
-0.06
POSITIVE LOGITS
(pipe
0.07
ृत
0.07
เศ
0.06
exagger
0.06
landlord
0.06
.ipv
0.06
जन
0.06
]));
0.06
(',');↵0.06
zpráva
0.06
Activations Density 0.020%