INDEX
Explanations
No clear pattern is discernible because the neuron never activates on the provided text, so its feature is undetermined from these examples.
New Auto-Interp
Negative Logits
fhew
-0.60
principalColumn
-0.60
kasarigan
-0.59
RefNanny
-0.59
purpoſe
-0.57
Anſ
-0.56
anſ
-0.52
Personensuche
-0.52
InputDecoration
-0.52
civilisation
-0.52
POSITIVE LOGITS
written
0.45
不禁
0.38
surla
0.36
pyplot
0.36
Италијани
0.34
CardModule
0.32
(`
0.31
statechange
0.31
geschrieben
0.30
verhe
0.30
Activations Density 0.000%
No Known Activations
This feature has no known activations.