INDEX
Explanations
celebrities
The main thing this neuron does is detect mentions of notable people’s names (celebrities, politicians, and other proper-name entities).
New Auto-Interp
Negative Logits
priest
-0.07
банк
-0.07
frustr
-0.07
ovi
-0.07
finals
-0.06
베스트
-0.06
στή
-0.06
initiative
-0.06
ブリ
-0.06
遺
-0.06
POSITIVE LOGITS
.ReadKey
0.07
.getenv
0.07
▏
0.07
.printf
0.06
.Down
0.06
gles
0.06
.Filters
0.06
.getAction
0.06
vaping
0.06
.setStyleSheet
0.06
Activations Density 0.035%